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PRIMER EXTENSION USING A MIXTURE OF LABELLED AND 
UNLABELLED NUCLEOTIDES 

CROSS-REFERENCES TO RELATED APPLICATIONS 
5 This application is a continuation-in-part of U.S. Application No, 09/585,768, 

filed June 2, 2000, which is incorporated herein in its entirety for all purposes.. 

FIELD OF INVENTION 
The present invention relates to the field of molecular genetics, particularly flie 
• identification and detection of certain nucleotide sequences. 

10 

BACKGROUND OF THE INVENTION 
The nucleic acids conq)rising the genome of an organism contain the genetic 
information for that organism. The translation or expression of these nucleic acids generates 
proteins that function in many diverse ways within the organism. Even minute changes in a 

15 nucleotide sequence, including single base pair substitutions, can have a significant effect in 
the quality or quantity of a protein. Single nucleotide changes are referred to as single 
nucleotide polymorphisms or simply SNPs, and the site at which the SNP occurs is typically 
referred to as a polymorphic site. 

Many SNPs, as well as larger nucleic acid alterations, can affect the phenotype 

20 of the organism, and in some instances can result in the onset of disease. For example, 
diseases associated with SNPs include: sickle cell anemia, P-thalassemias, diabetes, cystic 
fibrosis, hyperUpoproteinemia, a wide variety of autoimmune diseases, and the formation of 
oncogenes. In addition to causing or affectuig disease states, point mutations can cause 
altered pathogenicity and resistance to therapeutics that target certain microorganisms. 

25 The ability to detect specific nucleotide alterations or mutations in DNA 

sequences has a number of medical and non-medical utilities. For example, methods enable 
of identifying nucleotide alterations provide a means for screening and diagnosing many 
common diseases that are associated with SNPs. Methods that can quickly identify such 
changes or mutations are also valuable in taking prophylactic measures, assessing the 

30 propensity for disease, and in patient counseling and education. As for non-medical 

1 
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qiplications, such methods have value in the detection of microorganisms, resolvmg paternity 
disputes and in forensic analysis to identify peipetialors of crimes. 

Various methods have been developed to obtain sequence information for 
variants sites. Such mefhods include hybridization reactions betwem a target nucleic acid 
5 aod allele-specific oUgonucleotide (ASO) probes (see, e.g., European Patent PubUcadons EP- 
237362 andHP-329311), allele specific amplification (see, e.g., U.S. Pat. Nos. 5,521,301; 
5,639,61 1 ; and 5,981,176), mini-sequencing methods, quantitative RT-PCR methods (e,g.^ 
the so-called 'TaqMan assays"; see, U.S. Pat Nos. 5,210,015 to Gelfend, 5,538,848 to 
Livak, et aL, and 5,863,736 to Haaland, as well as Held, C A., et aL, Genome Research, 
10 6:986-994 (1996); Gibson, U.E.M, et al. Genome Research 6:995-1001 (1996); Holland, P. 
M., et al, Proc, Natl Acad. Set. USA 88:7276-7280 (1991); and livak, K.J., et al, PGR 
Methods and Applications 357-362 (1995)), and various single base pair extension (SBPE) 
assays. 

A number of SPBE assays have been developed, but tiie general ^proach is 

15 quite similar. Typically, these assays involve hybridizing a primer that is complementary to a 
target nucleic acid such that the 3' end of the primer is unmediately 5' of the variant site or is 
adjacent ther^o. Extension is conducted in die presence of one or more labelled non- 
extendible nucleotides that are complementary to the nucIeotide(s) that occupy the variant 
site and a polymerase. The non-extendible nucleotide is a nucleotide analog that prevents 

20 furth^ extension by the polymerase once incoiporated into the primer. If the added non- 
extendible nucleotide(s) is(are) complementary to the nucleotide at the variant site, then a 
labelled non-extendible nucleotide is incorporated onto the 3' end of the primer to generate a 
labelled extmsion product Hence, extended primers provide an indication of which 
nucleotide is present at the variant site of a target nucleic acid. Such methods and related 

25 methods are discussed, for example, in U.S. Patent Nos. 5,846,710; 6,004,744; 5,888,819; 
5,856,092; and 5,710,028; and in PCT publication WO 92/16657. 

SBPE methods involve the addition of labelled nucleotides of hi^ purity 
which contain low levels of unlabelled nucleotide. Such additions are deemed necessary to 
obtain acceptable signal-to-iK>ise ratios. The high concentration of label, however, can 

3 0 interfere with the fidelity and rate at which polymerases incorporate non-extendible 

nucleotide into the primer. Consequmtly, SBPE methods typically are limited to using 
polymerases diat are less error prone than other polymerases. Typically, the less error prone 
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enzyme is anon-fheimostable enzyme; utilization of tiiis ^e of enzyme can negatively 
affect the signal-to-noise ratio, since the non-themiostable polymerases cannot be used in 
cycling m^ods to increase the amount of labelled products. In addition, it is often necessary 
to use a specially modified polymerase to efScientiy incorporate a labelled nucleotide analog; 
5 fisdhire to use such enzymes can lead to serious levels of misincorporation (see e,g.. Reeves 
and Fuller, Nature 376: 796-797 (1995)). Further, because SBPE methods are limited to 
methods involving hi^ concentrations of labelled nucleotides, the methods are quite 
expensive. 

10 SUMMARY OF THE INVENTION 

The present invention provides mdbods and kits for detecting and identifying 
the nucleotide present at a variant site of a target nucleic acid of interest. The meliiods and 
kits can be utilized in research, clinical and laboratory settings. In general, the methods 
involve conducting extension reactions in the presence of a mixtiire of labelled and 

15 unlabelled farms of one or more nucleotides to generate labelled extension products that are 
characteristic of the nucleotide occupying the variant site of a target nucleic acid. The 
detection of incoiporation of labelled nucleotide provides an indication of the identity of the 
nucleotide at the variant site since the incorporated nucleotide is complementary to the base 
at the site of variatioiL The methods can be used in conducting genotyping analyses and can 

20 be performed in multiplexing formats. The methods and kits have utility in diverse 
^plications including, for example, analyzing point mutations and single nucleotide 
polymorphisms, detection of pathogens, paternity disputes, prenatal testing and forensic 
investigations. 

Certain methods of the invention are methods for analyzing a site of variation 
25 in a target nucleic acid and involve contacting a sample of the polynucleotide with multiple 
copies of a primer whose 3' end hybridizes adjacent to but not including the site of variation 
in the presence of a mixture of labelled and unlabelled forms of a nucleotide. The unlabelled 
form is typically 30% to 95% of the total amount of the labelled and unlabelled forms on a- 
molar basis. The primer copies andt^et polynucleotide are cbntacted under cbiidi^ons in" 
30 which if the nucleotide is complemezi^ary to the base occupying the site of variation in the 
targ^ polynucleotide, at least one copy of the primer is extended by incorporation of the 
labelled nucleotide to form labelled extension product. The incorporation of the labeled 
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nucleotide into the primer is then detected; the idmtity of the base incorporated into the 
primer indicates which base occupies the site of variation in the target polynucleotide. The 
nucleotide provided as a mixture can be an extendible nucleotide such as the naturally 
occurring deoxynucleotides (e.g.^ dATP, dTTP, dCTP and dGTP), a non-extendible 
5 nucleotide (e.g., a dideo^iynucleotide or an arabinoside triphosphate) or a combination 
thereof. 

Utilizaiion of a mixture of labelled and unlabelled forms can reduce the 
incidence of misincorporation that occurs vrbesi substantially pure labelled nucleotides are 
used to conduct extension reactions with thermostable polymerases. Misincorporation with 

10 some labelled nucleotides can be sufficientiy problematic with certain prior art methods that 
extension reactions must be performed witii non-thmnostable polymerases that are less prone 
to misincorporate labelled nucleotides as compared to tiiermostable polymerases. The use of 
non'thermostable polymerases, however, prevents one fix>m conducting thermocycling 
reactions to increase extension product concentrations since such thennocycling reactions 

1 S require a thermostable polymerase. By using a mixture of labelled and unlabelled forms, in 
some instances the level of misincorporation is sufficiaatly reduced that thermostable and 
modified themiostable polymerases can be utilized, thus making certain methods of the 
invention amendable to thermocycling techniques to increase extension product 
concentration. 

20 Multiple different nucleotides can be included in the extension reaction 

mixtures in certain mettiods. For example, in conducting genotyping experiments of a 
biallelic variant site, the two nucleotides complementary to the bases potentially occupying 
the variant site of the target nucleic acid can be added, each nucleotide added as a mixture of 
labelled and unlabelled forms. Thus, certain methods of the invention include contacting a 

25 sample containing multiple copies of a target polynucleotide witii a plurality of copies of a 
primer whose 3' end hybridizes adjacent to but not induding the site of variation in the 
presence of a plurality of mixtures of labelled and unlabelled forms of a nucleotide, each 
mixture containing a different nucleotide. The unlabelled form added is typically 30% to 
95% of the combined amount of the labelled and unlabelled forms on a molar basis. The 

30 target polynucleotide, primers and multiple mixtures of nucleotides are contacted under 

conditions in which if the nucleotide of a mixture is complementary to the base occupying the 
site of variation in the target polynucleotide, at least one copy of the primer is extended by 



wo 01/94546 



incoiporation of the labelled nucleotide. Incorporation of labelled nucleotide(s) into the 
primers is then detected, the particular labelled nucleotide(s) incorporated being an indication 
of the nucleotides present at the variant site. 

The invention also provides midtiplexing methods in which multiple variant 

5 sites can be analyzed at the same time. Certain such methods involve conducting a plurality 
of template-dependent extension reactions with a plurality of different primers and one or 
more mixtures of labelled and unlabelled forms of a nucleotide. The different primers 
utilized hybridize adjacent different sites of variation on target polynucleotides and the 
different mixtures contain different nucleotides. The unlabelled form in each mixture is 

10 typically 30% to 90% of the combined amount of the labelled and unlabelled forms on a 
molar basis. Each extension reaction involves contacting a sample of the target 
polynucleotides with multiple copies of one of the different primers, wherein the ptim^ bears 
an optional label and the 3' end of the primer hybridize adjacent to but not including the site 
of variation in the target polynucleotide. The primer copies are exposed to one or more of tiie 

IS mixtures of labelled and unlabelled nucleotides, under conditions such that if the nucleotide 
of the mixture is complementary to the base occiq)ying the site of variation in the target 
polynucleotide, at least one copy of the primer is extended by incorporation of tiie labelled 
nucleotide. 

The various extension reactions generate a plurality of different exten^on 
20 products. Extension products are detected by detecting the incorporation of labelled 

nucleotides into the extended primers as an indication of the bases occupying the site of 
variation in the target polynucleotides. Extension products generated from different sites of 
variation can be distinguished on the basis of the optional label borne by the extaided primer 
and/or label borne by incorporated nucleotide. Tlie extension reactions can be conducted 
25 concurrently in a single reaction vessel or in separate reaction vessels. When the different 
sites of variation are different sites on tiie same target polynucleotide or differmt sites on 
different polynucleotides, the extension reactions can be performed concurrentiy in a sin^e 
reactiozL vessel. 

" " ^ IMfifaent" extension products can be distinguishedby using differentially 

30 labelled primers, diffeafentially labelled nucleotides and/or different sized primers for the 

different variant sites. When the same site of variation on different target nucleic acids is to 

be examined (e.g., the same SNP site for diSmnt individuals), ext^ion reactions for 

5 
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samples fiom differeat individuals are normally performed in separate reaction vessels. The 
extension products can ihea be combined and analyzed to detect iocoxporation of label into 
the extension products, la this instance also» extension products from different reaction 
vessels can be distinguished by utilizing differentially labelled primers» diSerentialLy labelled 

5 nucleotides and/or difiTereat sized primers for the different extension reactions. 

The invention also provides kits for conducting the methods of the inveodon* 
In general, such kits typically include a mixture of labelled and unlabelled nucleotideSy the 
unlabelled nucleotide being 30% to 95% of the combined amount of unlabelled and labelled 
forms of the nucleotide on a molar basis. The kits can also include a primer that hybridizes to 

10 a segment of the target nucleic acid» wh^in the 3 ' end of the primer hybridizes adjacent to 
the site of variation in the target nucleic acid and bears an optional label, and a polymerase 
that catalyzes template-dq)endent extension of a primer in the presence of the appropriate 
template and nucleotides. 



15 BRIEF DBSCRIPTrON OF THE DRAWINGS 

FIG. 1 illustmtes the major steps of certain extension methods of the present 

invention. 

FIG. 2 depicts a sequence of the amplima: containing the Chr22.53 SNP (A/G 
polymorphism). The SNP site is represented by the boldfaced '"R" in parentheses, indicating 
20 that the base at the variant site is A or G* The primer binding sites are shown, with die 

forward and reverse PGR prim^ binding sites in bold type and the extension primer binding 
site underlined. 

FIG. 3 A shows electropherograms as plots of fluorescence intensity versus 
time for genotyping e^^iments conducted for 7 different individuals at the Chr.22.53 SNP 

25 site. Experiments were conducted using 200 nM TAMRA-ddATP and 100 nM RllO-ddGTP. 
Signals j&om extension products bearing the label Rl 10 generated &om the G allele are 
indicated by traces labelled G; signals for extension products bearing the label TAMRA 
corresponding to the extension product generated &om the A allele are shown by traces 
labelled A. The goiotypes, as listed on the left side of each panel, were confirmed by 

30 sequencing the amplim^ directly. 

FIG. 3B provides electropherograms as plots of fluorescence intmsity versus 
time for genotyping esqieiimrats at die Chr.22.53 SNP site performed witii the same 7 
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individuals used to generate fee results sunimariz^ Unlike the results shown in 

FIG. 3 A, however, experiments were conducted wifli mixtures of labelled and unlabelled 
nucleotides. The genotypes were determined using a mixture of 1 5 nM Rl lO-ddGTP/85 nM 
ddOTP and a mixture of 30 nM TAMRA-ddATP/70 nM ddATP. Extension product bearing 

5 the RllO label is indicated by traces labeUedG and represents fee G^^^ Extension 

product bearing the TAMRA label is represented by traces labelled A and corresponds to fee 
A allele. The correct genotypes are listed on fee left side of fee panels. Coff?>arison of FIG. 
3 A and FIG. 3B indicate that equitable genotyping results are obtained in fee presence of a 
mixture of labelled and unlabelled nucleotides. 

10 FIG. 4A depicts electropherograms as plots of fluorescence intensity versus 

time for genotyping expmments conducted wife 7 individuals at fee Chr.22.S3 site. 
E3q>erim©ats were conducted using 100 nM R6G-ddATP and 100 nM Rl 10-ddGTP 
nucleotides were not mixtures of labelled and unlabelled forms). Signals fiom extension 
products bearing fee R6G label generated fix)m fee A allele are r^resented by traces labelled 

15 A. Extension products bearing fee Rl 10 label formed fix>m fee G allele are represented by 
traces labelled G. The correct genotypes, as confirmed by sequencing, are listed on fee left of 
fee panels. Note fee noisincoiporation of Rl 10-ddGTP into fee "AA" homozygotes. 

FIG. 4B presents electropherograms as plots of fluorescence intensity versus 
time for genotyping experiments conducted wife fee same 7 individuals as described in FIG. 

20 4A at fee Chr.22.53 SNP site. Experiments were perfcmned using a mixture of IS nM Rl 10- 
ddGTP/85 nM ddGTP and a mixture of 15 nM R6G-ddATP/85 nM ddATP, Extension 
products labelled wife RUG (corresponding to product fix>m the G allele) are indicated by fee 
traces labelled G. l^ctension products labelled wife R6G (corresponding to product firom fee 
A allele) are represented by fee traces labelled A. Note fee suppression of Rl 10-ddG 

25 misincorporation into fee ''AA" homozygotes. The correct genotypes are listed on fee left of 
fee traces. 

FIG. S shows fee sequ^ce of fee amplimer containing fee 
CYP4.2D6.G4268C SNP (a G/C polymorphic site) (Sachse et aU Am. J. Hum. Genet. 60:" 
284-295 (1997)). The SNP site is indicated by fee boldfiaced "S" indicatm^^tfiltrfe^v&iak 
30 site is occupied by G or C. The forward and reverse PGR primer binding sites are in bold 
type and fee binding site for fee detection primer is underlined. 



7 
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FIG. 6A presets electroph^grams as plots of fluorescence intensity versus 
tune for genotyping experiments conducted with 7 different individuals £it the 
CYP4.2D6.G4268C SNP site. E:3q>eriments were conducted using a mixture of labelled and 
unlabelled nucleotides, specifically 10 nM Rl 10-ddCTP/ 90 nM ddCTP and a mixture of 30 
5 nMTAMRA-ddGTP/70nMddGTP. Signals fiiom extension products incorporating Rl 10 
label and corresponding to die C allele are represmted by the traces labelled with a C. 
Extension products labelled wi^ TAMRA and corresponding to the G allele are represented 
by traces labelled G. The genotypes, as indicated in parentheses, were also confirmed by 
sequencing the amplimers directly. 

1 0 FIG. 6B depicts electropherograms as plots of fluorescmce intensity versus 

time for genotyping experiments conducted with saiiQ)les fit>m tbe same 7 individuals 
described in FIG. 6A at the CYP4.2D6.G4268C SNP site. Experiments were conducted with 
a mixture of 10 nM Rl lO-dCTP/90 nM OTP and mixtures of 30 nM TAMRA-ddGTP/70 nM 
ddGTP. Extension products incorporating Rl 10 and generated fiom the C allele are 

1 S represented by traces labelled C. Extension products labelled with TAMRA and generated 
fit>m the G allele are represented by traces labelled G. The correct g^otypes are as indicated 
in parentheses. A comparison of FIG. 6A and 6B demonstrates ttiat a deoxynucleotide can 
also be used in the primer extension reaction to obtain the correct genotypes. 

FIG. 7 depicts a sequence of the amplimer containing the 

20 CYP450.2D6.G1749C SJJP (G/C polymorphism). The SNP site is represented by the 

boldfaced "S" indicating that the base at the variant site is G or C. The primer binding sites 
are shown with the forward and reverse PGR primer binding sites in bold type and the primer 
binding site underlined. 

FIG. 8 A shows electropherograms as plots of fluorescence intensity versus 

25 time for genotyping experiments conducted for 7 different individuals at the 

CYP450.2D6.G1749C site. Experiments were conducted using 100 nM Rl 10-ddCTP and 
100 nM R6G-ddGTP. Signals fix)m extension products bearing the label Rl 10 generated 
fix>m the C allele are indicated by traces labelled C. Fluorescent signals for extension 
products bearing the R6G label generated from the G allele.are shown by traces labelled G. 

30 The gCTLOtypes, as listed on the left side of each panel, w^ confirmed by sequencing the 
amplimers directly. As can be seen in this figure, the genotype was correctiy identified for 
each of the 7 individuals. 
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FIG. 8B provides electropherograms as plots of fluordsceiit intensity versus 
time for genotyping ejqierimeats at the CYP450.2D6.G1749C site conducted with the same 7 
individuals used to generate the results summarized uiFIG« 8 A. Unlike the results shown in 
FIG. 8A, however, e!}q>eriments were conducted with a inixture of labelled and imlabelled 

5 nucleotides. Extensionreactions were conducted in the presence of a rnixture of 10 nM 
Rl lO-ddCTP/90 nM ddCTP and a mixture of 10 nM TAMRA-ddGTP/90 nM ddGTP. 
Extension product bearing the Rl 10 label is indicated by traces labelled C and represents 
esctension product formed from the C allele. Extension product bearing the R6G label is 
represented by traces labelled G and corresponds to the G allele. The correct genotypes are 

10 listed on the left side of the panels. A comparison of FIG. 8 A and FIG. 8B shows that 
equitable genotyping results are obtained in the presence of a mixture of labelled and 
unlabelled nucleotides. 



DETAILED DESCRIPTION 

IS L DefiTn'tinnfi 

A '^nucleic acid" is a deoxyribonucleotide or ribonucleotide polymer in either 
single or double-stranded form, including known analogs of natural nucleotides unless 
otherwise indicated. 

A •^polynucleotide" ref^ to a single- or double-stranded polymer of 
20 deoxyribonucleotide or ribonucleotide bases. 

An "oligonucleotide" is a single-stranded nucleic acid typically ranging in 
length from 2 to about 500 bases. Oligonucleotides are often synthetic but can also be 
produced from naturally occurring polynucleotides. Oligonucleotides can be prepared by any 
suitable method, including, for example, cloning and restriction of appropriate sequences and 
25 direct chemical synthesis by a method such as the phosphotriest^ melhod of Narang et aL» 
Meth. EnzymoL 68:90-99 (1979); the phosphodiester method of Brown et aL, Meth Enzymol. 
68:109-151 (1979); the diethylphosphoramidite method of Beaucage a/.. Tetrahedron Lett. 
22:1859-1862 (1981); and the solid support method described in U.S. Patent No. 4,458,066. 

" ^"A '*prnn©r'*'is a'siiig^^stranded oligonucleotide capable of acting as a potbt'bf 
30 initiation of template-directed DNA synthesis under appropriate conditions (/.e., in the 

presence of four different nucleoside triphosphates and an agent for polymerization, such as, 
DNA or RNA polymerase or reverse transcriptase) in an appropriate buffer and at a suitable 
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toEDperature. The ^ropiiate length of a primer d^ends on the intended use of the primer 
but typically ranges fix>m IS to 30 nucleotides. Short primer molecules generally require 
cooler temperatures to form sufficiently stable hybrid complexes with the teni^late. A primer 
need not reflect the exact sequence of the template but must be sufficiently complementary to 

5 hybridize with a ternplate. The temi 'primer site" or 'primer binding site" lefcacs to the 

segment of the target DNA to which a primer hybridizes. The term **primer paif * means a set 
of primers including a 5* *^stream prim^' tiiat hybridizes with the complCTaent of the 5' end 
of the DNA sequence to be amplified and a 3' "downstream primer*' that hybridizes with the 
3* end of the sequence to be anq>lified. 

10 A primer that is 'perfectly complementary" has a sequence fiilly 

complementary aoross the entire length of the primer and has no mismatches. The primer is 
typically perfectly complementary to a portion (subsequence) of a target sequence. A 
'^mismatch" refers to a site at which the nucleotide in the primer and the nucleotide in the 
target nucleic acid witibi which it is aligned are not complementary. 

15 The term "substantially complementary" means titat a primer is not perfectly 

complementary to its target sequmce; instead, the primer is only sufficiently complementary 
to hybridize selectively to its respective strand at the desired primer-binding site. 

Hybridizations are usually perfiormed under stringent conditions that allow for 
specific binding between an oligonucleotide and a target nucleic acid. A stringent condition 

20 is any suitable buffer concentration and tmiperature that aUow specific hybridization of the 
oligonucleotide to a complementary nucleic acid. The phrase '"hybridizing specifically to" 
and related phrases, refers to the bindings duplexing, or hybridizing of a molecule only to a 
particular nucleotide sequence und^ stringent conditions when that sequence is present in a 
complex mixture. See M. Kanehisa, Nucleic Adds Res.^ 12:203 (1984). Stringent conditions 

25 are sequence-dependent and will be different in different circumstances. Longer sequences 
hybridize ^edfically at bighCT temperatures. Generally, stringent conditions are selected to 
be about 5 lower than flie thermal melting point (Tm) for the specific sequence at a 
defined ionic strength and pH. Hie Tm is the temperature (under defined ionic strength, pH, 
and nucleic acid concentratian) at which 50% of the oligonucleotides complementaiy to the 

30 target sequence hybridize to the target sequence at equihbrium. Typically, stringent 
conditions will be those in which the salt concentration is less than about 1 .0 M Na ion, 
typically about 0.01 to 1 .0 M Na ion concrotration (or other salts) at pH 7.0 to 83 and the 
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temperature is at least about 30 ®C for short oligonucleotides ie.g., 10 to SO nucleotides) and 
at least about 60 "^C for longer probes (e,g., greater than 50 nucleotides). Stringent conditions 
caa also be achieved with the addition of destabilizing agents such as fonnamide. 

A "site of variation" or 'Variant site" when used with reference to a nucleic 

5 acid broadly refers to a site wherein the identity of nucleotide at the site varies between 
nucleic acids that odierwise have similar sequences. For double-stranded nucleic adds, fh& 
variant site includes the variable nucleotide on one strand and the conLq)lemeQtary nucleotide 
on the other strand. A variant site can be the site of a single nucleotidB polymorphism or the 
site of a somatic mutation, including a point mutation^ a deletion, an insertion, and a 

10 rearrangement, for example. 

A **polymorphic marker" or **polymoipliic site" is the locus at which 
divergence occurs. Preferred markers have at least two alleles, each occurring at ftequ^cy 
of greater than 1%, and more preferably greater than 10% oir 20% of a selected populatiorL 
The first idmtified allelic form is arbitrarily designated as the reference form and other allelic 

IS forms are designated as alternative or variant alleles. The allelic £3rm occurring most 

frequently in a selected population is sometimes referred to as the wild-type form, whereas 
allelic forms occurring less frequently are referred to as mutant alleles. Diploid organisms 
may be homozygous or heterozygous for allelic forms. A diallelic polymorphism has two 
forms, a tri-allelic polymorphism has three forms and a tetra-allelic polymorphism has four 

20 forms. 

A "single nucleotide polymorphism" (SNP) occurs at a polymorphic site 
occupied by a single xmcleotide, which is the site of variation between allelic sequences. The 
site is usually preceded by and followed by highly conserved sequences of tfae.allele (e.g., 
sequences that vary in less than 1/100 or 1/1000 members of the populations). A single 

25 nucleotide polymorphism usually arises due to substitution of one nucleotide for anoth^ at 
the polymorphic site. A transition is the replacemmt of one purine by another purine or one 
pyrimidine by another pyrimidine. A transversion is the replacCTient of a purine by a 
pyrimidine or vice versa Single nucleotide poiymorphisms can also arise from a deletion of 
■ a nucleotide or an insertion of a nucleotide relative to a reference'allele. - -^^'^ ^ 

30 The term '^naturally occurring^" as applied to an objectm^ans that the object 

can be found in nature. 
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The tern "subjecr and "individual" are used interchangeably herein to refer 
to any type of organism^ but most typicsSLy is used to refer to a human. 

n. Overview 

Hie present invention provides a variety of methods, compositions and kits for 
determining the identity of a nucleotide present at a variant site in a target nucleic acid The 
methods are based in part i^n tiie une3q>ected finding that a mixture of labelled and 
unlabelled forms fiar a nucleotide can be used in single base pair extension (SBPE) mefliods 
to enhance identification of nucleotides at a site of variation in several respects while 
reducing the costs of analysis. More specifically, the methods of the invention involve 
conducting a template-dependent extension of a prima: whose 3* end is hybridized adjacent 
to a variant site in a target nucleic acid. In some methods, a single base conoplementary to the 
nucleotide occupying a variant site is integrated into the primer and is a non-extendible 
nucleotide whose incorporation prevents further extension of the primer. This non-extendible 
nucleotide is added to the extension reaction as a mixture of labelled and unlabelled forms. 
In certain methods, othOT nucleotides, such as an extendible nucleotide or nucleotide analog, 
can also be added as mixtures of labelled and unlabelled forms. For each nucleotide added as 
a mixture, typically the unlabelled nucleotide is at least 30%-50% of the total concentration 
of the nucleotide the sum of ttie labelled and unlabelled forms for that nucleotide) on a 
molar basis. 

SBPE methods that utilize high purity labelled nucleotides suffer in that the 
fidehty of nucleotide incorporation is negatively affected because labelled nucleotides tend to 
make many polymerases more mot prone (see, eg., Lee, L.Q., et aL, Nucleic Acids Res. 
20:2471-2483 (1992); and Ansorge, W., et al. Methods MoL Biol U.SA. 23:317-356 
(1993)). An example of this phenomenon is illustrated in tiie example infra. As a 
consequence of this loss in fidelity, some SBPE methods have used non-thermostable 
polymerases {e.g,^ Klenow fragment) that are less affected by labelled nucleotides. The use 
of such polymerases, however, significantiy restricts the utiUty of SBPE reactions because the 
use of non-theamostable polymerases precludes the use of thermal cycling reactions to 
increase signal-to-noise ratios. In view of such restrictions, the use of high concentrations of 
high purity labelled nucleotides has been deemed necessary to achieve satisfactory signal-to- 
noise ratios. In addition, certain modified or labelled nucleotides are not readily accepted as 
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substrates by wild type polymerases. To overcome this limitation, some SBPE methods have 
made use of a modified thermostable DNA polymerase (see, ag:, Ross et al.. Nature 
Biotechnology 16: 1347-1351 (1998)). 

The present invention, however, has une3q)ectedly found tiiat by significantly 

5 reducing the concenlratian of the labelled nucleotides fhrou^ the use of mixtures of labelled 
and non-labelled forms of a nucleotide tiiat extension reactions such as SBPE reactions can 
be performed without sacrificing signal-to-noise ratios. In particular, by using mixtures of 
labelled and non-labelled forms of a nucleotide, the methods can utilize Ifae more error prone 
thermostable polymerases witiiout compromising accuracy of nucleotide incorporation, as 

10 well as modified thermostable polymerases (e.g., Taq FS). The ability to utilize fliermostable 
polym«:ases and modified thermostable polymerases specifically selected for incorporating 
nucleotide analogs means that tiie methods of the invention are amenable to thermal cycling 
reactions to increase the amount of extension product formed, thus enabling the methods to 
achieve high signal-to-noise ratios despite using less labelled nucleotide. Hence, with the 

IS methods of the invention, one can obtain good signal-to-noise ratios but with less label and at 
lower cost Additionally, tiie employment of modified thermostable polymerases allows the 
use of a wider selection of labels since such polymerases can function properly witii a wide 
variety of dye types. Other non-modified polymerases are imable to incorporate many 
modified nucleotides. 

20 Moreover, the methods of the invention provide greats flexibility in 

controlling signal intensity which can greatly simplify data analysis, especially for multiplex 
reactions in which multiple polynucleotide targets and/or variant sites are interrogated 
simultaneously. Mediods employing hig^ concentrations of labelled nucleotides are poorly 
suited for regulating signal intensities. For example, in multiplex analyses using nucleotides 

25 that bear different labels, some signals can be quite weak, wh^as other signals can be quite 
strong. Because the methods of the invention utilize mixtures of labelled and unlabelled 
forms for a nucleotide, the relative concentration of different labelled nucleotides can be 
" varied to achieve signal output wherein the signal intensities are more consistent Such 
consistency facilitates datsr dbUectioto £md increases tiie accuracy of the results of data ' 

30 analysis. • 
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A. Qeaeraipescriptjjon 

The methods of fhe invention generally involve the use of mixtOFes of labelled 
and unlabelled forms of at least one nucleotide in conducting primer extension reactions to 
determine tiie identity of a nucleotide at a variant site in a target polynucleotide. Although, a 

5 variety of primer extension methods have been developed, tiie general approach of such 
methods is nonetheless quite similar* In brie^ the methods involve hybridizing a primer that 
is complementary to a target nucleic add such that Ifae 3* end of the primer hybridizes 
adjacent to, but does not span, the variant site of the target nucleic add of interest. The 
hybridization is typically performed in the presence of one or more labelled nucleotides 

1 0 complementary to a nucleotide that potentially occupies the variant site. Hybridization is 
performed under conditions allowing primer extension if a nucleotide complementary to a 
base occupying the variant site in the target nucleic add is present Extension results in the 
incorporation of a labelled nucleotide, thereby generating a labelled extended primer. 
Bxtended primers are detected and provide an indication of which base(s) occupy the site of 

1 S variation in the target polynucleotide. Examples of such primer tension methods include, 
but are not limited to, U.S. Pat. Nos. 5,710,028 to Eyal et al, 5,856,092 to Dale et al., 
5,846,710 to Bajaj et al., 5,888,819 and 6,004,744 both to Goelet eT a/., and PCT publication 
WO 92/16657 to Livak, et al.^ each of which is incorporated by reference in its entirety. The 
methods of the invention are generally applicable to these methods and other related primer 

20 extension methods. 



B. Strand Separation and Annealing 
1. Strand separation 

The various major steps of certain methods of the invention are illustrated in 

25 FIG. 1 . It will be appreciated by tiiose skilled in the art that the identity of the nucleotides 

represented at the variant site in FIG. 1 have been arbitrarily selected and that the nucleotides 

at the variant site could include any of the naturally occurring four bases A, T, G or C). 

The particular variant site shown if FIG. 1 is for an A polymorphic form. 

The methods of the invention begin with, the treatment of a sample that 

30 includes a duplex target nucleic acid to obtain mq)aired nucleotides that at least span fhe 

variant site of interest or, alternatively, to obtain sqparate strands. Of course, if the target 

nucldc add is already single-stranded, such a step is unnecessary. The term 'target nucleic 
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acid*' as used herein refers to single- or double-stranded nucleic acids that include at least one 
of the variant sites being interrogated. For double-stranded nucleic acids, the variant site 
includes the nucleotide at the site being examined and the complementary nucleotide in the 
complementary strand. If a double-stranded target nucleic add is denatured to form two 

S single strands^ each strand can be considered a target nucleic acid and either strand can serve 
as a template in the methods of the invention. 

Strand separation can be achieved using various dmaturing conditions that are 
known in the art including, for example, heat, alkali, fonnamide, urea, glyoxal and 
combmations thereof. Typically, strand separation is achieved using heat denaturation at 

10 temperatures ranging fix>m 80 to about 105 for time p^ods ranging from about 1 to 10 
minutes. Alternatively, single-stranded template can be generated flirough degradation of one 
strand by exonucleases (see, cg.^ Somers et al, Biochimica et Biophysica Acta 1379: 42-52 
(1998); Nikiforov et al, PGR Methods and AppHcations 3: 285-291 (1994); ffiguchi and 
Ochman, Nucleic Acids Research 17: 5865 (1989); and Straus and Zagursky, Biotechniques 

15 10:376-384(1991)). 

2. Armealmj? 

A primer (also referred to as a detection primer) is then amiealed under 
hybridizing conditions to a template strand of the target nucleic acid. The primer is capable 

20 of specifically hybridizing to a segment of the target nucleic acid such that its 3' end is 

adjacent to the variant site on the target nucleic acid (see FIG. 1). As used herein, tiie term 
''adjacent to" when used in refi^ence to hybridization between the primer and target nucleic 
acid typically means diat Ihe primer hybridizes to the target nucleic acid so that its 3' end is 
innnediately 5* to the variant site. However, the 3' end can be located several nucleotides 5' 

25 to the variant site so long as none of ttie nucleotides between the 3' end of the primer and the 
variant site are the same as Ihe nucleotide that potentially occupies the variant site. 

3. Primers 

A variety of different types' of primers can be utilissed'with the present 

30 metiiods. Suitable primers include, for example an oUgodeoxyribonucleotide, an 

oligoribonucleotide, a pq)tide nucleic acid or a copolymer thereof. Primers can be either 

naturally occurring nucleic acids or prepared using synthetic methods. If synthesized, ttie 
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primeis can be synthesized cither enzymatically in vitro, enzymatically in vivo or non- 
enzymatically in vitro. 

Depending upon the nature of the target nucleic acid (see section on samples 
infra) various combinations of primer/target nucleic add diq>leKes can be formed* For 

5 example, in some methods the template is a deoxyribonucleic acid and the primer is an 

oligodeoxyribonucleotide, an oligoribonucleotide, or a copolymer thereo£ la sudi instances, 
a DNA polymerase is utilized to generate a DNA product In certain other methods, tiie 
template is a riboimcleic acid and the primer is an oligodeoxyribonucleotide, an 
oligoribonucleotide, or a copolymer thereof. Reverse transcriptase can be utilized to form a 

10 DNA product In yet other methods, the template is a deoxyribonucleic add and the primer is 
an oligoribonucleotide. Added RNA polymerase can produce an RNA product from such a 
diq>lex. Finally, if the ten^late is a ribonucleic add and the primer an oligoribonucleotide, 
then an RNA replicase can form an RNA product 

Primers are sufficiently long to spedfically hybridize to the appropriate target 

1 5 nucleic acid and to form a stable hybridization complex under the extension reaction 

conditions. Typically, the primes are 15 to 50 nucleotides in length; in other instances, the 
primers are 20 to 30 nucleotides long. The length of the primers can be adjusted to be longer 
or somewhat shorter depending upon the particular sequence to which a primer hybridizes 
ie.g., primers with a high G/C content typically can be shorter than those with a low G/C 

20 content). Most typically, the primos are designed to be perfectly complementary over their 
entire length witii the template strand. However, in certain methods the primers are 
substantially complementary to the target nucldc add; mismatches in such instances, 
however, should not adversely affect the stability of the prinMr/target nucleic acid 
hybridization complex. 

25 In certain methods, the primer can include one or more moieties that allow for 

the afBnity separation of the extension product from unincoiporated reagents and/or target 
nucldc add and/or other nucleic acids in the test san^le. Such attachment moieties are 
described further infra in the section on detection. Alternatively, the primer can include a 
modified nucleotide that fecilitates selective labelling following the extension reaction. 

30 Hence, the term "label** when used in reference to label borne by a primer, or the term 

•labelled primer," or reference to a primer bearing a label includes primers that actually bear 

a label, as well as primers that include modified nucleotides for attachment of label 
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subsequent to extension. Additionally, the prima: can include a label to &cilitat6 detection of 
extended primer. Suitable labels can be selected from those described below in the section on 
labeled micleotides. 

5 C. Extension 

With continued reference to FIG. 1, once the primer has been allowed to 
anneal to a target nucleic acid, the diq>lex of primer and target nucl^c acid is contacted with 
one to four dififermt nucleotides, with at least one nucleotide being added as a mixture of 
labelled and unlabelled forms. Generally, ttie nucleotide chosen is selected to be 

10 complementary to a nucleotide potentially at the variant site. 

The nucleotide(s) included in extension reactions can be any of the naturally 
occurring deoxynucleotides {le. dATP, dGTP, dTTP and dCTP) or derivatives, so long as the 
nucleotide can be incorporated at flie 3 ' end of a primer in a template-dependent fashion. The 
nucleotide can be an extendible nucleotide, a non-extendible nucleotide or a combination of 

1 S extendible and non-extendibie nucleotides. 

An extendible nucleotide refers to nucleotides to which another nucleotide can 
be attached at the 3' position of the sugar (e.g^., the hydroxyl group in the naturally occurring 
deoxynucleotides dATP, dTTP, dCTP and dGTP) during primer extension. A '*non- 
extendible nucleotide" refers to nucleotide analogs that once incorporated into the primer 

20 cannot be extended further, f .e., there is no 3 ' hydroxyl group or the 3' hydroxyl group has 
been modified such that another nucleotide cannot be attached at the 3' position. Thus, 
suitable non-extendible nucleotides include nucleotides in which the 3' hydroxyl group is 
substttuted with a diGferent moiety such lhat another nucleotide cannot be joined to the non- 
extendible nucleotide once incorporated into a primer. Such moieties include, but are not 

25 limited to, -H, -SH and other substituent groups. Specific examples of non-extendible 
nucleotides include dideoxynucleotides and arabinoside triphosphates. 

In some mediods, a deoxynucleotide and a dideoxynucleotide can be used 
togettier, such as in determining the nature of a biallelic variant site via primer extensioii. In 
the presence of one or more edctendiblb nucleotides^ primer extension can go beyond the 

30 variant site depending oh* tlie configuration of the experiment This can be advantageous in 

certain instances as it provides anoth^ criterion for allele calling. For example, if one or 

more nucleotides immediately 3' of the variant site are the same as die nucleotide at tiie 
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variant dte, ejrtaision can proceed 3* of the variant site to include these nucleotides when 
reactions are condiicted with extendible nucleotides complementaiy to the nucleotide at the 
variant site. M this case, different allelic forms can be distinguished on the basis of the 
identity of the label inwwporated into the extension product (assuming differentially labelled 
5 nucleotides are used far diffearent allelic forms), as weU as the size of flie extenaon product. 

When extendible nucleotides are inchidcd, a variety of techniques can be used 
to control the extent of the extension reaction. Such tediniques include, conttolling 
polymerase concentration, limiting extenaon reaction times and conducting extensiwi 
reactions at low taiq)CTatures (ag-., approximately 40 to 60 °C for Taq FS). 
10 Of the nucleotide or nucleotides provided, at least one is provided as a mixture 

of labelled and unlabclled forms. If multiple nucleotides are added to a single reaction, at 
least one nucleotide is provided as a mixture of labelled and unlabelled forms. The 
concentraticm of the unlabelled form for a particular nucleotide relative to the total 
concentration of labeUed and unlabelled forms for that nucleotide is at least 20%, 25%, 30%, 
15 35%, 40%, 45%, 50%, 55% or 60% on a molar basis. The upper limit on the concCTtration of 
tiie unlabeUed forms relative to flie total amount of botii forms is typicaUy 80%, 85%, 90% or 
95% on a molar basis. In certain methods, the unlabelled conceatration typically is 30% to 
95% on a molar basis. In other metiiods, flie unlabelled fonn is genially 50% to 90% on a 
molar basis. In stiU oflier mefliods, ratio of unlabeUed to labelled forms on a molar basis is 
20 1:2. AMiough commercially available labelled nucleotides (e.g., TAMRA-ddATP), can 
sometimes contain relatively low levels of unlabelled forms (typically about 5%), the 
concentration of unlabeUed forms used in the present invention are above such levels and is 
in accordance with the ranges set forth above. 

The nucleotides can be provided according to a variety of different formats. 
25 For exanq»le, one to four nucleotides can be provided as mixtures of labeUed and unlabeUed 
forms witiiin a angle reaction mixture. For example, a sample can be divided into different 
reactions and a different mixture of labeUed and unlabeUed nucleotides added to each 
reaction. Thus, for example, if a variant site includes two aUelic forms, a sample can be 
divided into two different reactions. A mixture containing labeUed and unlabeUed nucleotide 
30 con^lementary to one aUelic form can thrai be added to one reaction mixture and a mixture 
containing labeUed and unlabelled nucleotide complementary to the other aUeUc form added 

to the oflier reaction mixture (e.g., adenosine nucleotide added to one reaction and guanine 
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nucleotide added to tbe other for an A/G polymoiphism). Alternatively, both mixtures can be 
added to a single reaction if the nucleotides are differentially labelled 

Adding a single nucleotide to a reaction mixture ensures that only a single 
type of extension occurs and can in some instances reduce costs. The addition of all 4 
5 nucleotides can in some instances inarease the fidelity of incorporation. Also, tiie addition of 
all 4 nucleotides can allow samples to be more rapidly analyzed by reducing tiie number of 
reactions; 

Further, if multiple nucleotides are added to a single reaction, it is not 
necessary for evay nucleotide to include both labelled and unlabelled forms. Thus, for 

10 example, if all four nucleotides are added to a reaction mixture (e.^., ddATP, ddTTP, ddCTP 
and ddGTP), one, two, three or all four of the nucleotides can be provided as mixtures of 
labelled and unlabelled forms. 

As indicated above, in some instances the 3' end of the primer is several 
nucleotides upstream of the variant site. The nucleotides of the target nucleic acid in this gap 

IS should not include a nucleotide that potentially occupies the variant site being analyzed. In 
such a situation, unlabelled deoxynucleotide triphosphates (dNTPs) compleindltary to th& 
nucleotides between the 3 ' end of the primer and the variant site should be included in the 
extension reactions. These dNTPs are in addition to at least one nucleotide added to the 
reaction mixture that is complementary to a base potentially at the variant site and added as a 

20 mixture of labelled and unlabelled forms. 

A polymerase is also added to the reaction mixture to initiate the incorporation 
of one or more of the nucleotides. If an added nucleotide is con^lementary to the nucleotide 
at the variant site of die target nucleic acid, then the polymerase catalyzes the incotporation 
of this nucleotide to generate an extended primer. Some of the extended primer product is 

25 labelled as a consequence of the incorporation of the labelled nucleotide, whereas other 

extension products lack label because of incorporation of unlabelled nucleotide (see FIG. 1). 
Since the incorporated nucleotide is complemratary to the nucleotide occupying the variant 
site, the incorporated nucleotide provides the basis for identifying the nucleotide(s) at the site 
ofvariktion. " * ' - " .^..^..^^t ^ 

30 Asnotedabove,bypioviding tiienucleotidesasmixtures of labelled 

unlabelled forms, fbe present methods can allow for increased fidelity of nucleotide 

incorporation as conq)ared to methods using primarily labelled nucleotides in which the label 
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can int^ere with incoiporatLon. The enhaaced fidelity of incorporation means that the 
present methods can utilize a wider range of polymerases, including dieimostable and 
modified thermostable polymerases. Methods using primarily labelled non-extendible 
nucleotides, in contrast, often are limited to less error prone polymerases that are often non- 
5 thermostable. 



polymerase" refer to those polymerases whose activity is maintaiaed even when reaction 
mixtures are heated sufficiently to denature duplex nucleic acids to form single-stranded 
nucleic adds for attachment of prim^. Examples of suitable thermostable polymerase 

10 include, but are not limited to, Taq FS (Applied Biosystems), ThennoSequenase 

(Amersham/Phannacia Biochemicals), and Taq (Applied Biosystems). The modified 
thermostable polymerases refer to polymerases that have been modified to have inoreased 
bindmg afSnity for modified nucleotides (e.^^., dideoxynucleotides) and thus readily 
incorporate such nucleotides. The term '^on-themiostable polymerase," in contrast, refers to 

1 5 polymerases which undergo a significant or total loss of activity when heated to achieve 

denaturation of double-stranded nucleic acids hito single-stranded nucleic acids such that die 
primer can hybridize to its primer binding site. Non-thennostable polymerases include, for 
example, DNA polymerase I firom E. coli (i.e., Klenow fiagment), Sequenase (USB), and 
large fragment of Bst Polym^ase (New England BioLabs). 

20 The ability to use thermostable polymerases, especially those developed for 

fluorescent sequencing (e,g.y Taq FS and ThermoSequenase), means that Ifae present methods 
can optionally utilize thermocycling methods to increase the amount of extended primer and 
allows the use of a greater selection of labeled nucleotides. Because of such capabilities, the 
present methods can utilize less labelled material without sacrificing signal-to-noise ratios, 

25 thereby providing strong signals at Iowct cost. As shown in FIG. 1 A, the fiiermoc^cling 
methods involve repeating the strand separation, aimealing and extension reactions with a 
thermocycler to regulate the temperatures at the Expropriate level to promote these steps (see, 
e.g,, Proudfoot, et al^ Science 209:1329-1336 (1980) and Sambrook e( aL, Molecular 
Cloning: A Laboratory Manual, 2nd edition. Cold Spring Harbor Laboratory Press (1989)). 



The terms 'thermostable polymerase" and ''modified diermostable 
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ETctensioii product formation is genoslly detected by detecting the 
incorporation of label into the primer by various direct or indirect methods . In some 
instances, extension product is initially separated Sx>m unreacted reactants in the extension 
reaction niucture before extraision product is detected. With certain techniques, ho wever, 
5 such separation is mmecessary and the methods can be performed in a homogenous format 
When more than one nucleotide is added to extension reactions and each 
nucleotide is added as a mixture of labelled and unlabelled forms, one can control the signal 
intensities for extension products incorporating the different labels by adjusting the relative 
concCTtration of the different labelled nucleotides. This allows one to regulate signal 

1 0 intensities 5uch that signals for differentially labeled extension products are similar. The term 
"similar^* when used to refer to relative signal strengttis means that fhe difference in signal 
intensity for a given labelled extension product relative to the average signal intensity for all 
the extension products formed fix>m the same variant site under similar reaction and detection 
conditions is typically less than 10%, more fi:equentiy less than 30% and most often less than 

15 50%. 

1. Separation Based Methods 

Once extension reactions have been completed, extended primer product is 
typically separated from unextended primers and unincorporated labelled nucleotide to 

20 fiicilitate analysis. Following removal of components Ibat might interfere with the detection 
step, extended primer is analyzed for presence or absence of labeL The labelled nucleotide 
incorporated into a primer is an indicator of the nucleotide that occupies the variant site. 

Separation of the extmded primers from other reaction components can be 
achieved in a variety of ways. In one ^proadi, the prim^ includes an attachment moiety 

25 that is one componmt of an afBnity pair and that allows for afSnity purification of extended 
primer from other components of the extension reaction. Typically, the attachment moiety is 
located at or near the 5' end of the primer. Alternatively, the attachment moiety is connected 
to die nucleotide. The other member of tiie a£5nity pair is iSrequentiy attached to a solid 
support such that extended piruScer b^iarihg label can be'bound to the support via the attached 

30 memb^ of the afBnity pair. Other reaction components can tiioi be washed away. Another 
option besides the use of afSnity purification is to separate extended primer from other 
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reaction components using gel eiectrophoiesis. Yet another option is to selectively inactivate 
the label associated with unincorporated nucleotide. 

A variety of different attachment moieties can be used as part of an afiSnity 
pair to achieve purification of the extended primer from other conq>onents. In general teems, 

5 the attachment moiety^ and the other conq^anent of the afSnity pair include two agmts that are 
capable of specifically binding to one another. Exanq>les of such binding pairs include, but 
are not limited to, polynucleotide^complementary polynucleotide, biotin/avidin, 
antigen/antibody and heavy metal/thiol groiq>. In some instances, one member of tbe afBnity 
pair is attached to a solid si^port. A solution containing (or potentially containing) a primer 

1 0 bearing the complementary member of Ihe a£Bnity pair is then contacted with the s\q>port 
Afi;^ allowing the two components an opportunity to bind and fbrm a complex, other species 
in the extension reaction mixture can be washed from the sipport. 

Thus, for example, in one suitable arrangement, the attachment moiety is a 
polynucleotide that serves as a 5' extension to the prim^. A complementary nucleotide is 

15 attached to a solid support and is cq>able of selectively binding the extension primer. 

Altematively, an antigen functions as the attachment moiety and an antibody specific thereto 
is attached to the support. In yet another anrangement, a thiol group is linked to the primer 
and serves as the attachment moiety. A heavy metal group attached to the solid support can 
be used to selectively bind the thiolated primer. 

20 The attachment moiety can be attached at any point of the primer where it 

does not interfere with the extension reaction. Most typically, the attachment moiety is 
attached at or near tiie 5' end of tiie primer. However, in some instances, the attachment 
moiety is connected to a more internal nucleotide. 

Instead of attaching the attachment moiety to the primer, in some methods the 

25 attachment moiety is part of the nucleotide. The attachmmt moiety can be selected from the 
group of afiSnity pairs described above, for example (see, also, U.S. Pat. No. 5,710,028 to 
Eyal, et aL). Altematively, one can obtain antibodies specific to a fluorescent dye label on 
the nucleotide (e.g., an antibody elicited to fluorescein as a hapten). Such antibodies have 
he&i discussed (see, Voss, B.W., Jr. (Ed.) Fluorescein Hapten: An Immunological Probe). 

30 A variety of different types of supports can be utilized in methods employing 

affinity-binding pairs. Suitable supports include, but are not limited to, beads, microparticles, 

the surfece of a microtiter well, a filter and a glass slide. Similarly, the supports can be 
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fomed from any material Stable to the binding an^ 

example^ glass, polystyrene, cellulose, latex, nitrocellulose, nylon, polyacrylannde, dextran 
and agarose. 

As an alternative to Hie use of afOnity binding pairs, extended primes^ can be 
5 separated from other reaction components by a variety of size based separation techniques 
such as gel electrophoresis and size exclusion chromatography (e.^ HPLQ. hi some 
methods, separation of components by gel electrophoresis and the detection step (typically 
detection of fluorescence from a fluorescent label attached to the extended primer via the 
incorporated nucleotide) is performed using a sii^le integrated instrument, such as the Prizm 
10 DNA Sequencers &om Applied Biosystems, and MegaBACE from Molecular Dynamics. 

2. Homogeneous Assays 

Certain mettiods are conducted in a homogenous assay format in which 
extension products do not need to be separated fiom other extension reaction components 

IS (e.g.^ unextended primer and unincorporated nucleotide). In some methods, this is 

accomplished using donor and accq)tor fluorophofes, including fluorescence resonance 
energy transfer pairs. The fluorophores are chosen so that the emission spectrum of one 
fluoxophore (/.e., tiie donor fluorophore) overlaps flie excitation spectrum of the other 
fluorophore (f.e. , the acceptor fluoxophore). 

20 With such methods, the labelled nucleotide bears one member of the 

donor/acceptor dye pair and the oth^ member is attached to tiie primer. The primer can be 
labelled at a position such ttiat upon the incorporation of the labelled nucleotide the donor 
and acceptor are brought into an energy transfer relationship, wherein fluorescence ensrgy 
can be transferred finm the donor to the acceptor. By measuring fluorescence changes lliat 

25 occur as a consequence of energy transfer (e.g., a decrease in the fluorescence intensity of the 
donor or an increase in the fluorescence intensity of the acceptor), one can detect the 
incorporation of label onto the primer without having to separate extension product fiom 
unreacted reactants. Specific labels suitable for use m such methods are discussed infra in the 
sectidn on labels. Furth^ guidance on such niethods is described in U.S. Patent'No;"*'^'^^ 

30 5,945,283. * 



3. Indirect Methods 
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An altmiative detection xnetliod is to measure the change in the substrate {ie., 
fiee nucleotides) concentrations as an indication of which nucleotide(s) was incorporated into 
the piimer during the extension reactioiL Fluorescence polarization techniques can be 
utilized conveniently for Uns purpose. This technique is able to distinguish between large 
5 and small molecules based on molecular tumbling. Large molecules (e.^., labelled extension 
product) tumble in solution much more slowly than small molecules. Thus, the signal fiom a 
labelled nucleotide incorporated into a primer can be distinguished fiom the labelled 
nucleotide free in solution (see» e.g., Chen et aL^ Genome Research 9:492-8 (1999), and U.S. 
Pat. No. 5,593,867 to Walker et aL^ both of which are incorporated by reference in their 
10 entirety). 

4. Labels 

The labelled nucleotide includes a label that is either directly or indirectly 
detectable. The label covalently attached to the nucleotide, and optionally to the primer, can 

15 be any compound or molecule ttiat can be detected and that does not significantly interfere 
with the extension reaction (eg., interfering sufficiently such that an undetectable amount of 
extension {iroduct is formed and/or causing elevated rates of misincorporation such that an 
accurate determination of the identity of the nucleotide at the variant site is not possible). 
Suitable labels include, but ace not limited to, fluorophores, chromophores, molecules that 

20 emit chemiluminescence, magnetic particles, radioisotopes, mass labels, electron dense 

pardcleSy electrochemically active molecules, enzymes, cofactors, substrates for enzymes and 
ligands having specific binding partn^s (e.g., avidin/biotin). Mass labels can be prepared 
fix)m various monomers. By joining differing numbers of monomers together, mass labels of 
differing molecular weight can be prq>ared. Essentially, any ^e of monomer that can be 

25 joined together and then attached to a primer can be used» so long as the mass label does not 
interfere with hybridization of the primer to the target nucleic add. 

Certain methods utilize fluorescent molecules as the labels, as a number of 
commercial instruments have been developed for tiie detection of fluorescently labelled 
nucleic acids. A variely of fluorescent molecules can be used as labels including, for 

30 sample, fluorescein and fluorescdn derivatives, rhodamine and rhodamine derivatives, 

n^hthylamine and ns^hthylamine derivatives, cyanine and cyanine derivatives, 

benzamidizoles, ethidiums, prqpidiums, anthracyclines, mithramycins, acridines, 
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actinomyciiis, merocyanines, coumarins, pyrenes, cbiysenes, stilbeaes, anthracenes, 
lu^htbalenes, salicyclic acids, braz-2-oxa-l-diazoles (also called benzofuiazans), 
ftuorescamines and bodipy dyes. 

For those methods in which the detection primer and/or the detection product 
5 are labelled with fitiorescent dyes capable of energy transfer to enhance emission intensities 
or sinq)lify the assay, a nmnber of donor (or reporter) and an acc^tor (or quench^) dyes are 
available. One group of donor and acceptor dyes includes the xanthene dyes, such as 
fluorescein dyes, and rhodamine dyes. A variety of derivatives of these dyes are 
commercially available. Often functional groiips are introduced into the phenyl group of 

10 these dyes to serve as a linkage site to an oUgomicleotide. Another general group of dyes 
includes &e n^hthylamines which have an amino group in the a^ha or beta position. Dyes 
of this general type include lKlimethylatxnnonaphthyl--S-sulf(mate, l-anilino-8-naphtfaalene 
sulfonate and 2-p-toluidinyl-6'naphthalene sulfonate. 

Other dyes include 3-phenyl-7-isocyanatocoumarin, acridines, such as 9- 

15 isotfaiocyanatoaoidine and acridine orange, pyrenes, benzoxadiazoles and stilbenes. 

Additional dyes include 3-(8-carboxypentyl)-3'-ethyl-5,5'-dimethyloxa-carbocyanine (CYA); 
6-caifooxy fluorescein (FAM); 5&6-carboxyrhodamine-l 10 (Rl 10); 6-carboxyifaodamine-6G 
CR6G); N',N' J^%N'-tetramefliyl -6-carboxyrhodamine (TAMRA); 6-carboxy-X-riiodamine 
(ROX); 2', 4\ 5% 7% - tetrachloro - 4 - 7 - dichlorofluorescein (TET); 2', 7' - dimethoxy - 

20 4% 5' - 6 carboxyrhodamine (JOE); 6-caiboxy-2'4,4%5',7,7'-hexachlorofluorescein (HE30; 
AIJBXA; Cy3 and Cy5. These dyes are commercially available from various suppliers such 
as Applied Biosyst^ns Division of Ferldn Ehner Corporation (Foster City, CA), Amersham 
Pharmacia Biotech (Piscataway, NT), and Molecular Probes, Inc. (Eugene, OR). 

Further guidance regarding the selection of donor and acceptor pairs that can 

25 effectively be used with the methods of the present invention includes: Fluorescence 
Spectroscopy (Pesce et aL, Eds.) Marcel Dddcer, New York, (1971); White et al.. 
Fluorescence Analysis: A Practical Approach^ Marcel DeUcer, New York, (1970); Berhnan, 
Handbook of Fluorescence Spectra of Aromatic Molecules, 2^ ed:. Academic Press, New 
York, (1971); GrifiBtiis,Ci?/aura/ttf Goto "'^ — 

30 New York, (1976); Indicators OBishop, Ed.). PergamoA Press, Oxford, 19723; and Haugjand, ' " 
Handbook of Fluorescent Probes and Research Chemicals^ Molecular Probes, Eugene 
(1992). 
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IV. MiiltipWrnpr/Poolmg 

The fact tiiat the methods of fhe invention generate labelled products allows 
for the &cile extension of the basic method described above to multiplexing fonnats in ^ch 
5 the identity of a nucleotide at multiple variant sites is detemiined in a single reaction. Such 
foimats allow for rapid sequence determinations in many loci and/or individuals 
simultaneously. The multiple sites can be multiple sites on the same target nucleic acid, such 
sites bdng within the same gene or at sites in difTerent geaes. Alternatively, the multiple 
sites can be the same site on target nucleic acids obtained from different individuals, or 

10 multiple different sites on target nucleic acids from different individuals. 

Ttie multiplexing methods closely parallel the methods just described. 
Primers for each of the different variant sites are annealed to their respective binding sites. 
The general structure of the primers and the methods for conducting the extension reactions is 
as set forth above. Here too, the particular labelled nucleotide(s) incorporated into tiie 

15 extended primer serves to identify the nucleotide(s) present at the variant site. 

In Older to correlate the multiple extension products with tiie various sites, a 
nmnber of different strategies can be utilized to aid ia determining which extension product 
corresponds with which variant site. One option is to differentially label the nucleotides such 
that different labels are used for different variant sites. 

20 Another approach involves using pruners of different lengths so that the 

extension products formed are of differing sizes and capable of bdng distinguished by size 
fiactionation (e.g.» by gel electrophoresis). If this approach is utilized, the primers can dif^ 
by as little as a single nucleotide, although typically the difference in size is larger to 
facilitate the ease by which different extension primers can be distinguished. 

25 Alternatively, primers can be tagged with distinctive identifier tags. A number 

of different tags can be utilized. In some instances, a secondary label is used in conjunction 
with the label used to identify the nucleotide present at a particular variant site. Such 
secondary labels can be any type of molecule or conipound that is detectable (see supra). In 
oth^ instances, the tag is part of an afOnity pair and facilitates separation of the different 

30 extension products one firom the other, in addition to fimctiomng to separate the extended 

primer &om other extension reaction conqionents as described above. Suitable afBnity pairs 

include tiiose described supra in the section on detection methods. 
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The mefliods utilizing primers of dijffereat size or tagged primers can be used 
in conjunction with flie scheme in which different labels are used for difieroit variant sites or 
different nucleotides at a particular variant site. In this way, extension products can be 
identified and distii^;uished fix)m one anoth^ on the basis of two criteria ratiier than simply 

5 one criterion. For example, in some methods, different nucleotides are attached to different 
labels. Further, primers include different tags for different variant sites. The different 
extension products can then be identified and/or separated both according to the different tags 
and the different labels. 

The methods can also be utilized in pooling studies to determine the allele 

10 frequency of a variant site in a study population. Typically, in these type of experiments, flie 
DNA samples &om different individuals are pooled together. Then tiie method of this 
invention can be used to analyze the presence of each allele in the mixed templates. By 
comparing the signal intensities of each allele with a reference set (for example, the 
heterozygotes, the homozygotes or a mixture of both at a known ratio), the prevalence of the 

IS alleles in the population can be determined. (For a general discussion of pooling studies see, 
e.g., Breen G. et al, BioTechniques 28:464-468 (2000); Risch N. and Teng, J., Genome Res. 
8:1273-1288 (1998); Shaw, S.H. et al, GenomerRes. 8:1 11-123 (1998); and Scott, DA. et 
al. Am. J. Hum. Genet. 59:385-391 (1996), each of which is incorporated by reference in its 
entirety). 

20 

V. Genotvping 

A diploid organism contains two copies of each gene. Genotyping iuvolves 
the determination of whether a diploid organism contains two copies of die reference allele (a 
reference-type homozygote), one copy each of the reference and variant allele (i.e., a 

25 het^zygote), or contains two copies of the variant allele (i.e., a variant-type homozygote). 
When conducting a genotyping analysis, the methods of the invmtion can be utilized to 
intarogate a single variant site. However, as described above in the section on 
multiplexing/pooling, the methods can also be used to determine allelic frequency in a group 
of individuals, as well as the genotype of an individual in many differenf DNA loci', eiithSr^^^ 

30 the same gene, different genes or combinations tiiereof. 

Most typically, SNPs consist of two allelic forms, the variant site includes 

one of two different nucleotides. The sanq)le can contain nucleic acids representative of the 
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two copies of the target nucleic acid of interest Analyses can be conducted witti a single 
nucleotide, but more typically labelled nucleotides complementary to both nucleotides 
potentially at the site of variation are utilized. Formation of a siagle labelled extended prim^ 
indicates that the sample is fix>m a homo^gote. The particular labelled nucleotide 

5 incorporated signifies whether the sample is fiom a refereuce-type or variant-type 

homozygote. The formation of two labelled extension products indicates that the sample is 
firom a het^zygote. Reactions can be conducted separately such that each nucleotide is 
added to a different reaction mix, or reactions can be conducted in a single reaction mixture 
containing both nucleotides. If reactions are conducted in a single reaction vessel, the 

10 labelled nucleotides are differentially labelled so that the differmt allelic forms can be 

distinguished. If different reactions are conducted with each labelled nucleotide, the labels 
for each labelled nucleotide can be the same or differmt since the particular nucleotide added 
to each reaction is tracked. 

For polymorphisms Ifaat include more than two allelic forms, additional 

15 labelled nucleotides can be used. For example, for triallelic polymorphisms, three 
differentially labelled nucleotides can be used. In like maimer, with tetra-aUelic 
polymorphisms, four diflferentially labelled nucleotides can be employed. Here, too, all the 
nucleotides can be added to a single reaction mixture or to sq>aral:e reaction mixtures. 
Typically, any additional nucleotides are provided as mixtures of labelled and unlabelled 

20 forms. 

The use of mixtures of labelled and unlabelled forms for any particular 
zmcleotide can significantly enhance data acquisition and analysis during genotyping 
experiments. A problem with methods that use different labels for different nucleotides is 
that some labels are considerably weaker or stronger signal onitters than other labels ia a 

25 particular detection devise. This can be problematic in that if two labels of varying sigpal 
strmgth are used for genotyping a biallelic SNP, heterozygotes can generate two peaks 
centered at different wavelengths two different colors), each having differing heights. 
This can fiiistrate an accurate identification of tiie allelic forms present because 
rmsincorporation, even at a low level, can generate a similar pattern. With the methods of the 

30 present invration, however, similar signal intensities for labels having different signal 

strengths, can be easily achieved simply by altering the ratio of labelled to unlabelled forms 
for each nucleotide. 
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The ability to use tiie methods of the invention to make rapid genotyping 
detemiinations provides a powerfiil tool in genetic analysis and ascertaining the susceptibility 
of an individual to a disease. Lidividuals tiiat are mutant homozygotes for an allele 
associated with a particular disease are at higher risk of having the disease than a 
5 heterozygote or a homozygote for the other allele. The heterozygote, however, is a carrier of 
the allele associated with the disease. Such knowledge can be useful in prenatal and other 
types of medical and genetic counseling, for example. 

VL Samples 

10 A. Types of Target Nucleic Acids 

The methods of the present invention can be utilized to determine the identity 
of a nucleotide at a variety of dififerent types of variant sites including, but not limited to, 
SNPs and mutations such as transitions, transversions, insertions and deletions. Hie presence 
or absence of a target nucleic acid in a sample can be'detected generally as the presence or 

IS absence of a particular nucleotide at a particular site. Individual nucleotides located at a 
particular site can also be identified by the methods described herein. 

Hie methods presented are generally applicable to deoxyribonucleic acids, 
ribonucleic acids, or copolymers thereof. The nucleic acids can be single-stranded or double- 
stranded. The target nucleic acid can include non-naturally occurring nucleotide analogs 

20 including, for example, deoxyinosine or 7-dieasa-2-d6oxyguanosine. Such analogs destabilize 
duplex DNA and allow a primer annealing and extension reaction to occur in double-stranded 
nucleic adds without completely separating the two strands. In some instances, RNA 
samples are first reversed transcribed to form cDNA before use. 

The target nucleic acid can be only a fraction of a larger nucleic acid or can be 

25 present initially as a purified and discrete molecule. Additionally, the target nucleic acid can 
constitute the entire nucleic acid or can be a firaction of a complex mixture of nucleic acids. 
The tai:get nucleic acid can be synthesized enzymatically in vivo, synthesized enzymatically 
m vi^a, or synthesized non-enzymatically. 

30 B. Sources 

The target nucleic acid can be fiom any source. The samples that include the 

target nucleic adds can be natural or synthetic using enzymatic or organic synfliesis 
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techniques. Likewise, the sample caa be taken &om any organism, including but not limited 
to, plants^ miax>organisms ie.g., bact^a, fimgi and viruses), vertebrates, invertebrates and 
mammals (eg., humans, primates, horses, dogs, cows, pigs and sheep). 

For assay of genomic DNA, virtually any biological sample (other than pure 
5 red blood cells) is suitable. Saniples can be obtained from the tissues or fluids of an organism 
or organisms; samples can also be obtained firom cell cultures, tissue homogenates or 
synthesized as described above. For exanq>le, samples can be obtained 6om whole blood, 
serum, semen, saliva, tears, urine, fecal material, sweat, buccal, skin, spinal fluid and hair. 
Samples can also be derived firom in vitro ceU cultures, including the growth medium, 

10 recombinant cells and cell components. For assay of cDNA or mRNA reverse transcribed to 
form cDNA, die tissue sample is obtained firom an organ in which the target nucleic acid is 
expressed. For example, if the targrt nucleic acid is a cytochrome P4S0, the liver is a suitable 
source. Samples for use in prenatal testing can be obtained firom anmiotic fluid. 

The target nucleic acid(s) can also be obtained fiom non-living sources 

1 5 suspected of containing matter firom living organisms. For example, in the instance of 

samples obtained for forensic analysis, the target nucleic acids can be obtained firom samples 
of clothing, furniture, weapons and other items found at a crime scene. 

C. Sample Preparation 

20 Li some instances, the samples contain such a low level of target nucleic adds 

diat it is usefid to conduct a pre-anq)lificatLon reaction to increase the concentration of the 
target nucleic acids. If samples are to be an^lified, amplification is typically conducted 
using the polymerase chain reaction (PGR) according to known procedures. See generally, 
PCR Technology: Principles and Applications forDNA Amplification (H.A. Erlich, Ed.) 

25 Freeman Press, NY, NY (1992); PCR Protocols: A Guide to Methods and Applications 

(Lmis, et al., Eds.) Academic Press, San Diego, CA (1990); Mattila et aL, Nucleic Acids Res. 
19: 4967 (1991); Eckert et al, PCR Methods and Applications 1: 17 (1991); PCR 
(McPhersone/ a/. Ed.), IRL Press, Oxford; and U.S. Patent Nos. 4,683^2 and 4,683,195, 
each of which is incorporated by reference in its entirety. Other suitable amplification 

30 metiiods include: (a) ligase chain reaction (LCR) (see Wu and Wallace, Genomics 4:560 

(1989) and Landegren et al.^ Science 241:1077 (1988)); (b) transcription amplification (Kwoh 
et al.^ Proc. Natl Acad. Sci. USA 86:1173 (1989)); (c) self-sustained sequence replication 
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(Guatelli et aL Proa Nad. Acad. ScL USA, 87:1874 (1990)); and (d) nucleic add based 
sequence amplification (NABSA) (see, Sooknanan, R. and Malek, L., Bio Technology 13: 
563-65 (1995)), each of which is incorporated by reference in its entirety. 

Further guidance regarding nucleic sanq>le prq>aration is described in 
5 Sambrook, et al.. Molecular Cloning: A Laboratory Manual^ 2™* Ed., Cold Spring Harbor 
Laboratoiy Press, (1989), which is incoiporated herein by reference in its entirety. 

m Kits/Compositions 

Compositions and kits for conducting the sequence and genotyping 

10 determinations desoibed herein are also provided by the invention. The compositions 

include mixtures of labelled and imlabelled nucleotides such as those described above. The 
concentration of labelled to unlabelled forms for a nucleotide is as indicated above, but most 
typically the unlabelled form is 50% to 90% of the total concentration of the labelled and 
unlabelled forms as expressed on a molar basis. Although various nucleotides can be utilized 

15 in the kits, most fi:equently the nucleotide is a dideoxynucleotidc. The labelled forms can 
include any of the labels described supra; most typically, however, the label is a fluorophore, 
especially FAM, ROX, TAMRA, RllO, R6G, JOE, TET, HEX, Alexa dyes, Cy3 and Cy 5. 

The compositions of the invention can be combined with other components to 
form kits. Typically, the kits include one or more of the primers. Once hybridized to a target 

20 nucleic acid of interest, the 3* end of the primer is adjacent the variant site of the target 
nucleic acid of interest Often the primers are designed for use in detecting one or more 
SNPs such as those described infra. The number of primers included in the kit can vary. 
Generally, the kits include at least 2, 3, 4 or 5 primers. Other kits can include more primers, 
such as at least 10, 15, 20 or 25 primers. In some kits, the primers can also bear a label or 

25 attachment moiety to facilitate the detection/separation process. 

The kits can include various other components for conducting template- 
dependent extension reactions including, for example, a polymerase, nucleoside 
triphosphates, and buffers. The polymerase can be either a thermostable^pr non-tbeimosteM 
polymerase. Kits can also include the necessary electrophoretic components to size separate 

30 the extension products formed during an analysis. Such components include gel polymers 
(e.g., agarose), polymerizing agents and bufT^. Typically, the kits also include containers 
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for housing fhe various components and iastructions for using the kit conq)onents to conduct 
an analysis. 

VnL Utihtv 

5 The mediods> compositions and kits of the invention are generaUy liseftd for 

determining (he identity of a nucleotide at a variant site. These methods, however, find use m 
a variety of more specific applications. One use is the idmtification and detection of point 
mutations ie.g., somatic point mutations), specifically those mutations coirelated with 
diseases. For sample, the methods desoibed herein are useful for identifying whether a 
10 nucleic acid fix>m a particular subject includes a wild-type allele or a mutant allele at a 

particular SNP site. Furthennoie, the methods can be utilized to establish the g^otype of the 
individual being tested (z.^., distinguish whether the individual is a reference^type 
homozygote, a heterozygote or a variant-type homozygote). 

The genotyping utiUty of the present methods makes them useful within the 
IS context of medical diagnosis and prognosis. Since many SNPs are associated with various 
diseases, clinicians can utilize the results of the genotype study to assess the presence of 
disease, whether an individual is a carrier of disease, the likelihood that an individual will get 
a particular disease and the likely efficacy of various treatment alt^natives. 

The methods also have a variety of non-medical uses. Such utilities include 
20 detecting pathogenic mia^oorganisms, paternity testing and forensic analysis. The methods 
can also be used to identify SNPs in non-humans, including, for example, other animals, 
plants, bact^a and viruses. 

These various uses are described more fully below. 

25 A. Correlation Studies 

Use of the methods of fhe present mvention to acquire diagnostic information 

involves obtaining a sample &om a number of different individuals known to have a common 

disease and conducting screening tests to detormine whether they consistentiy share a 

common genotype at one or more SNP sites. The results of such screening can be used to 

30 establish correlations between certain genotypes and certain diseases. 

In a related fashion, the methods of the invention can be used to develop 

correlations between certain genotypes and patient prognosis. For example, the genotype of a 
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population of individuals sufTering fiom a common disease can be detemiined at one or more 
SNP sites. The health histoiy of flie individuals can be moniloied with time to establish 
correlations between certain genotypes and disease outcomes. 

Hie methods of the invention can also be used to formulate optimal treatment 

5 protocols for a particular disease. For example, the ability of an individual to metabolize 
certain drugs may be associated with a particular genotype(s). The methods described herein 
can be used to place individuals into groups that share a common phenotype and genotype. 
The group can then be subdivided into various groiq>s that each receives different forms of 
treatment. By monitoring the heaMi status of the differmt treatment groiq[>s over time^ the 

10 most effective treatment program for a particular genotype can be established. 

B. Use of Curren t Methods as Screening and Therapeutic Tool 

In. instances in which a correlation between a particular genotype and disease 
state or dmg response have already been established, the methods of the invention can be 

1 S utilized as a diagnostic tool, a prognostic tool and as a means for optimizing treatment. 

For patients having symptoms of a disease, the methods of the present 
invention can be used to detmnine if the patient has a gmotype known to be associated with 
a disease that commonly causes the symptoms the patient exhibits. For example, if the 
genotyping methods of the invention show that the individual has a genotype associated with 

20 a particular disease and further that the genotype is associated with poor recovery a 
variant-type homozygote), the physician can counsel the patient regarding the likely 
effectiveness of aggressive treatment options and the option of sinsply foregoing such 
treatmmts, especially if the disease is quite advanced. On the other hand, if the genotype is 
associated with good recovery, the physician can describe a range of treatment options 

25 varying Scorn sin:q>ly monitoring the disease to see if the condition worsens or more 
aggressive measures to ensure that the disease is attacked before it gets worse. 

The methods of present invention are also valuable for assessing the actual 
risk of an individual known to be susceptible to aoquiiing a disease (e.g., an individual 
coihin^ from a feniily'ttiat hajs a MsISd^^ from a ctisease). By determining whether 

30 the individual is a homozygote for the SNP associated with the disease or a heterozygote, a 
physician can more accurately assess and coimsel the patient regarding the likelihood fbst the 
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patieat will begin suffering from disease, factois involved in triggmng the disease and the 

pros and cons regarding different treatmmt alternatives. 

Similaily, certain methods ofthe invention can also be used to idotitify 

individuals at risk for disease, even though they have no symptoms of disease or no known 
5 susceptibilities to disease. An individual in this category would generally have no disease 

symptoms and have no family history of disease. Jn such cases, the methods ofthe present 

invCTtion can be utilized as a usefol preventive screening tool. Using the methods ofthe 

present invention^ a number of selected SNP sites known to be associated with certain 

diseases can be intenogated to id^tifyfte genotype ofthe individiial at those sites^ If a 
1 0 particular genotype were identified that was known to be associated with a particular disease, 

then a physician could advise the individual regarding the likelihood that the disease would 

manifest itself and the range of treatment options available. 

In yet anoth^ q>plication, the present invention can be useful in formulating 

optimal treatment for a patient. For sample, an individual's response to a drug may be 
1 5 predicted based on his or h&c genotype. The information enables a physician to prescribe the 

most effective medication with minimal side effects. 

C. Kyatnpleg nf Diseases t hat can be Monitored 

A large niunber of diseases have been shown to be correlated wiHi particular 

20 allelic forms of SNPs. A large number of such SNPs are listed in WO 93/02216 and by 
Cooper et aL (Hum. Genet. 85:55-74 (1990)), both of which are incorporated herein by 
reference in their entirety. Specific examples of diseases associated with SNPs include: 
sickle cell anemia and P-thalassemias (mutation in p-globin gene; Antonarakis, New Eng. J. 
Med.y 320:153-163 (1989)), cystic fibrosis (mutation in cystic fibrosis transmembrane 

25 recqitor (CFTR); see Kerem, et aL, Science 245:1073-1080 (1989)), hyperlipoprotememia 
(mutation hi apolipoprotein E gene; see Mahley, Science 240:622-630 (1988)), a wide variety 
of autoimmune diseases (mutations in human major histoconqyatibility complex; see 
Thomson, Ann. Rev. Genet., 22:31-50 (1988); Morel et aL, Proc. NatL Acad. Sci. USA, 
85:81 11-8115 (1988); and Scharf, et aL, Proc. NatL Acad. ScL USA, 85:3504-3508 (1988)) 

30 and the formation of oncogenes (mutations to the human ras-gene family; see, e.g. , Bos et aL, 
Nature, 315:726-730 (1985); Fair et al., Proc. NatL Acad. Scl USA, 85:1629-1633 (1988); 
and Neri, et aL, Proc. NatL Acad. ScL USA, 85:9268-9272 (1988)). Other genes containing 
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SMPs associated with disease include groes encoding for angiotensinogen, angiotensin 
converting razyme, cholesterol ester transfer protein, dopamine receptors, serotonin 
recq)tors, and HIV reverse transcriptase (RT). 

5 D. Other Uses 

The methods described herem can also be used to identify point mutations in 
pathogens that could potentially result in altered pafliogenicity or resistance to certain 
therq)6Utics. The methods can also be used to identify cells and strains having a desired 
genetic constitution for use in various biotechnology applications. The methods described 
10 herein can also detect the presence of somatic mutations that can result in various diseases, 
including cancer for example. 

With knowledge gained from the genotyping methods described herem, 
clinicians can conduct prenatal testing using cells obtained from a fetus to check for a variety 
of inheritable diseases, such as those diseases associated with the SNPs listed above. The 
IS methods can also be used to identify carriers of mutant alleles. Such information can be of 
use by a couple prior to conception as they evaluate the risks of having a child with certain 
birth defects or inheritable diseases. 

Metiiods of the invention can also be utilized in various identification 
applications, such as in the field of forensic medicine or paternal testing. In the case of 
20 foroQLsic analysis, polymorphisms in specific genes can be determined in, for example, blood 
or semen obtained &om a crime scene to indicate whether a particular suspect was involved 
in the crime. In like manner, polymorphism analysis may be utilized in disputes to aid in 
determining whether a particular individual is the parent of a certain child. 

In another ^plication, certain methods of the invention are used in blood 
25 typing or tissue classification. Tissue classifications, for example, can be determined by 
identifying polymorphisms specific for a particular individuaL 

The following examples are provided to illustrate certain aspects of the 
inventidii,*and should not be conistrued in any wayto limit the scope of the invention^/. ^ 

30 ' ' ' : : 

- EXAMPLE 1 

Genotypmg of tiie Chi22.53 SNP, the CYP4.2D6.G4268C SNP 
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and the CYP450.2D6.G1749C SNP 



5 



Materials and Met^pds 

A. Amplific ation of Target Nucleic Acid 

Amplimers were generated througli the PGR (Folymerase-Chain-Reaction) 



process using the HotStart Taq system from Qiagen. The PGR primer bmding sites for the 
three SNPs analyzed. Chr. 22.53 SNP, CYP4.2D6.G4268C, and CYP450.2D6.G1749C SNP 
are shown in FIGS. 2, 5 and 7, respectively. Amplifications were conducted according to the 
manufactmier's recommendations. Primers were obtained from Operon Technologies, Inc. or 
10 synthesized usmg an ABD948 Nucleic Acid Synthesis & Purification Systran. Typically. 40 
cycles of PGR were performed to give a yield of 5-20 ng/(jl DNA. 



addition of exonuclease I and shrimp alkaline phosphatase (USB) at 1 unit/10 pi. After 
proper mixing, the tubes were incubated at 37 for 1 hour followed by 15 minutes at 80 G. 
15 At this stage, a small fraction of each sample was analyzed by agarose gel electrophoresis to 
determine tiie quality and quantity of the PGR products. 



20 a 5-|il reaction containing the following: Ix buffer for Taq FS as recommended by Peddn 
Elmer, 0.2 pM extension primer, 0.25 unit Taq FS (Peikin Elmer), and dye-labelled 
dideoxynucleotide triphosphates (NEN Life Science Products, Inc.) at the concentrations 
indicated in the figure legends. Extension was performed for 30 cycles in a GeneAmp PGR 
System 9700 (Perkdn Elmer). Primer binding sites for the three SNPs studied are shown in 

25 FIGS. 2, 5 and 7. 

C. Analygig of Extension Products 

After desalting via etfaanol precipitation, the san^les were analyzed on a 
sequencing gel using the ABI 377 Sequencer. Results were plotted as fluorescence intensity 
30 versus time. The results shown (see FIGS. 3A, 3B, 4A, 4B, 6A, 6B, 8A and 8B) have been 
corrected for spectrum overlaps of the two dyes used. 



Excess dNTPs and primers were removed from the PGR products by the 



B. 



Primer Extension 

For primer extension, {^proximately 10 ng of each PGR amplimer was used in 



36 



wo 01/94546 



PCT/USOl/17928 



n. Chr.22.S3 Genotvping Results 

A series of genotyping ^^iments fi3r the Chr. 22.53 SnP (see FIG. 2) were 
conducted to compare the accuracy of genotyping results and the ability to control signal 
5 intensity with methods using substantially pure labelled fimns of nucleotides and methods 
using mixtures of labelled and unlabelled forms of nucleotides. As used to refer to the purity 
of labelled nucleotides, the term ''substantially pure" means that the unlabelled form of the 
nucleotide was only present at the trace levels found in commercially available products. 
Typically, unlabelled forms are approximately 5% of the total labelled and unlabelled 
10 nucleotides on a molar basis. 

A. Rl 10 and TAMRA Labeled Nucleotides 

A first experiment was conducted using substantially pure labelled 
nucleotides. FIG. 3 A shows electropherograms as plots of fluorescence intensity versus time 

1 5 for this set of genotyping e^eriments conducted with 7 different individuals at the Chr.22.53 
SNP site. E3q>eriments w^e conducted using 200 nM TAMRA-ddATP and 100 nM Rl 10- 
ddGTP. Signals fiom extension products bearing the label Rl 10 generated fix>m the G allele 
are indicated by traces labelled G. Fluorescent signals for extension products bearing the 
label TAMRA. generated fiom the A allele are shown by traces labelled A). The genotypes, 

20 as listed on the left side of each panel, were confirmed by sequencing the an^^limers directiy. 
As can be seen in this figure, tiie genotype was correctly identified for each of the 7 
individuals. However, the signal intensities for the different alleles differ. Significant 
differences in signal intensities can complicate computerized analysis of signal traces. For 
example, in the G/A het^ozygotes, the intensity of the A peak is 30-50% of the G peak. 

25 Small A peak is also observable in the G/G homozygote. An unsophisticated auto-allele- 
calling program may not be able to make accurate genotype determination in certain 
instances. 

A second set of genotyping experiments were conducted with the same 

individuals aM at the same SNP sitett^^^ 

30 nucleotides. More specifically, extension reactions were conducted in Ihe presence of a 

mixture of 15 nM Rl lO-ddGTP/85 nM ddGTP and a mixture of 30 nM TAMRA^dATP/70 

nM ddATP. Results showing extension products formed for different individuals is 
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summarized in the electropherograms shown m FIG. 3B. Extension product bearing the 
Rl 10 label is indicated by traces labelled G and repr^ents extension product formed fixmi the 
G allele. Extension product bearing tiie TAMRA label is represented by traces labelled A 
and corresponds to tiie A allele. The correct genotypes are listed on the left side of the 
5 panels. Comparison of HQ. 3A and FIG. 3B indicate that equitable genotyping results are 
obtained in the presence of a mixture of labelled and unlabelled nucleotides. This 
comparison also demonstrates the ability to regulate the relative concentrations of the labelled 
nucleotides such that the signal intensities for different extension products is sunilar. 

10 B. Rl 10 and R6G Labeled Nucleotides 

A third ^eriment with the same 7 individuals and SNP site was conducted 
using substantially pure labelled nucleotides. However, instead of using Rl 10 and TAMRA 
labelled nucleotides as wilh the preceding experiments^ RUO and R6G labelled nucleotides 
were utilized. Experiments were conducted using 100 nM R6G-ddATP and 1 00 nM Rl 10- 

15 ddGTP. Electropherograms of the extension products formed for the 7 individuals is shown 
in FIG. 4A. Signals from extension products bearing the R6G label generated from tiie A 
allele are represented by traces labelled A. Extension products bearing the Rl 10 label 
formed from the G allele are represented by traces labelled G. The correct genotypes, as 
confirmed by sequencing, are listed on the left of the panels. As can be seen in this figure, 

20 Rl 10-ddGTP was misincorporated into individuals that were "AA" homozygotes. 

A fourth genotyping ^eriment similar to Qie third experiment was conducted 
but with a mixture of labelled and unlabelled nucleotides. In particular, experiments were 
performed using amixture of 15 nM RllO-ddGTP/85 nM ddGTP and a mixture of 15 nM 
R6G-ddATP/85 nM ddATP. Results of llie extension reactions are shown in FIG. 4B. 

25 Extension products labelled with RllO (corresponding to product &om the G allele) are 

indicated by the traces labelled G. Extension products labelled with R6G (corresponding to 
product fix>m the A allele) are represented by tiie traces labelled A. The correct genotypes are 
listed on the left of the traces. As a comparison of HG. 4A and 4B illustrates, Rl lO-ddG 
misincorporation into the '*AA*' homozygotes was significantiy suppressed using mixtures of 

30 labelled and unlabelled nucleotides as compared to methods using substantially pure labelled 
nucleotides. 



38 



wo 01/94546 PCT/USOl/17928 



m. CYP4,2D6>G4 268C Genotvping Results 

Another set of genotyping experimeDts were conducted with 7 individuals at 
the CYP4.2D6,G4268C SNP (Sachse ^a/., Am. J. Hum. Genet. 6:284-295 (1997), which is 
incorporated by refi^ence in its CTtirety). The sequaice suirounding this SNP is shown in 
5 FIG. 5. This experiment investigated the ability to utilize mixtures of labelled and unlabelled 
tarms of a deo^goiucleotide in combination with a dideoxynucleotide. 

A. Combination of Dideoxvnucleotides 

One experimCTt was conducted using a mixture of labelled and unlabelled 
10 nucleotides, specifically 10 nM Rl 10-ddCTP/ 90 nM ddCTP and a mixture of 30 nM 

TAMRA-ddGTP/ 70 nM ddGTP. FIG. 6A presents the electropherograms obtained for the 
different mdividuals. Signals fi-om extension products incoiporatmg Rl 10 label and 
corresponding to the C allele are represented by the traces labelled with a C. Extension 
products labelled with TAMRA and corresponding to the G allele are rqjresented by traces 
IS labelled G. The genotypes, as indicated in parentheses, were also confirmed by sequencing 
the aniplimers directly. 

B. Combination of Dideoxynucleotide and Deoxvnucleotide 

Another experiment was conducted using mixtures of labelled and unlabelled 
20 forms of a dideoxynucleotide and a deoxynucleotide. Experiments were conducted with a 

mixture of 10 nM RllO-dCTP/90 nM dCTP and mixtures of 30 nM TAMRA-ddGTP/70 nM 

ddGTP. FIG. 6B d^icts the electropherograms obtained for each of flie 7 individuals tested. 

Extension products incorporating Rl 10 and generated fiom the C allele are represented by . 

traces labelled C. Extension products labelled with TAMRA and generated firom the G allele 
25 are represented by traces labelled G. The correct genotypes are as indicated in parentheses. 

A comparison of FIG. 6 A and 6B demonstrates that a deoxynucleotide can also be used in the 

primer ext^tision reaction to obtain the correct gmotypes. 

W: eYP450.2D6,G1749CSNPGmotvDing-R^l^^^ : 

30 A. ' Labelled Nucleotides 

One set of genotyping e}q)eriments was conducted with 7 mdividuals at the 

CYP450.2D6.G1749C SNP (Sachse et al.. Am. J. Hum. Genet. 6:284-295 (1997), which is 
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incorporated by reference in its entirety). The sequence surrounding this SNP is shoiTvn in 
FIG. 7. 

A first experimmt was conducted using substantially pure labelled 
nucleotides. FIG. 8A shows electropherograms as plots of fluorescence intensity versus time 
S for this set of genotyping experiments conducted with 7 different individuals at the 

CYP450.2D6.G1749C site. Experiments were conducted using 100 nM Rl lO^ldCTP and 
1 00 nM R6G-ddGTP. Signals firom extension products bearing the label RllO generated 
&om the C allele are indicated by traces labelled C. Fluorescent signals for extension 
products bearing the label R6G g^erated firom the G allele are shown by traces labelled G. 
1 0 The genotypes, as listed on the left side of each panel, were confirmed by sequencing the 
amplimers directly. As can be seen in FIG. 8 A, Ifae genot3^e was correctly id^xtified for 
each of the 7 uidividuals. 

B. Mixture of Labelled and Unlabelled Nucleotides 
15 A second set of genotyping experiments was conducted with the same 

individuals and at the CYP4.2D6.G1749C SNP site but using a mixture of labelled and 
TUilabelled nucleotides. More specifically, extension reactions were conducted in the 
presence of a mixture of 10 nM RllO-ddCTP/90 nM ddCTP and a mixture of 10 nM 
TAMRA-ddGTP/90 nM ddGTP. Results showing extension products formed for different 
20 individuals is summarized in the electropherograms shown in FIG. 8B. Extension product 
bearing the Rl 1 0 label is indicated by traces labelled C and represents extrasion product 
formed fcom the C allele. Ext^ion product bearing the R6G label is represented by traces 
labelled G and corresponds to the G allele. The correct genotypes are listed on the left side of 
the panels. 

25 A comparison of FIG. 8A and FIG. SB d^ionstrates that the results obtained 

with a mixture of labeled and unlabeled nucleotides is consistent with .the results obtained 
using just labelled nucleotides. 

It is understood that. the examples and embodinients described herein are for 

30 illustrative piuposes only and that various modifications or changes in lig^t thereof will be 

suggested to persons ^led in the art and are to be included within the spirit and purview of 

this application and scope of the appended claims. All publications, patents and patent 
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applications cited herein are hereby incorporated by reference in their entirety for all 
purposes to the same extent as if each individual publication, patent or patent s^plicadon 
were specifically and individually indicated to be so incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 



A method of analyziiig a site of variation in a target polynucleotide^ 



2 



coiiq>iising: 



3 



(a) 



contacting a sample of the polynucleotide with multiple copies of a 



4 primer that hybridizes adj aceut to but not including the site of variation in the presence of a 

5 mixture of labelled and unlabelled forms of a nucleotide, wherein the unlabelled form is 20% 

6 to 95% of the total amount of the labelled and unlabelled forms on a molar basis, the 

7 contacting being performed under conditions whereby if the nucleotide is complementary to 

8 the base occupying the site of variation in the target polynucleotide, at least one copy of the 

9 primer is extended by incorporation of the labelled nucleotide; and 

10 (b) detecting incorporation of the labeled nucleotide into the primer as an 

1 1 indication of the base occupying the site of yariation in the target polynucleotide. 

1 2. The method of claim 1 , wherein the labelled and unlabelled forms of a 

2 nucleotide are labelled and unlabelled forzos of a non-extendible nucleotide. 

1 3 . The method of claim 1, wherein the non-extendible nucleotide is a 

2 dideoxynucleotide or an arabinoside triphosphate. 

1 4. The method of claim 1, wherein the labelled and unlabelled forms of a 

2 nucleotide are labelled and unlabelled forms of an extendible nucleotide. 

1 5. The method of claim 1^ wherein the unlabelled form is 50% to 90% of 

2 the total amount of labelled and unlabelled forms on a molar basis. 

1 6. The method of claim 1 , furttier comprising mixing imlabelled and 

2 labelled forms of nucleotide ficom separate sources. 

1 7. The method ofclaiml, wherein the conditions comprise pres^ce of a 

2 thermostable polymerase* and performing multiple cycles of denaturation» annealing and 

3 primer extensioiL 



1 
2 



8. The metiiod of claim 7, wherein the thermostable polymerase is 
selected from the group consisting of Taq, Taq FS and ThermoSequenase. 
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1 9. The method of claim 1, wherein the conditions comprise presence of a 

2 non-thermostable polymerase and performing a single cycle of primer extension. 

1 10. The method of claim 9, wherein the non-fheanostable polymerase is 

2 selected from the groiq) consisting of Klenow fragment, Bst Polymerase (large fragment) and 

3 Sequenase. 

1 11. The method of claim 1, wherein the label is selected from the group 

2 consisting of a fluorqphore, a diromophore, a radioisotope, an enzyme substrate, an electron 

3 dense agent, a magnetic particle, an eltetrochemically active moiety and a mass label. 

1 12. The method according to claim 1 1, wherein the label is a fluorescent 

2 dye. 

1 13. The method according to claim 12, wherein the fluorescent dye is 

2 selected from the group consisting of FAM, ROX, TAMRA. Rl 10, R6G, Joe, TET, HEX, 

3 Alexa, Cy3 and Cy 5. 

1 14. The method of claim 1 , wherein the multiple copies of primer are 

2 labeled with a primer label different from the label home by the labelled nucleotide, whereby 

3 if the nucleotide is complementary to the base occup3dng the site of variation, then at least 

4 one copy of extended primer bears labels from both labelled primer and labelled nucleotide. 

1 IS. The method of claim 14, wherem the primer label and the label borne 

2 by die labelled nucleotide comprise a donor fluorophore and an acceptor fluorophore. 

1 16. The method of claim 1 , wherein detection of labelled nucleotide 

2 incorporation is performed by fluorescence polarization. 

1 17. A method of analyzing a site of variation in a target polynucleotide, 

2 comprising: 

3 (a) contacting a sample of the polynucleotide with multiple copies of V 

4 primer that hybridizes adjacent to but not including the site of variation in the presence of a 

5 plurality of mixtures of labelled and unlabelled forms of a nucleotide, each mixture 
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6 containing a different nucleotide^ for each nucleotide the unlabelled form being 20% to 95% 

7 of the combined amount of the labelled and imlabelled forms on a molar basis, and the 

8 contacting step being performed under conditions whereby if the nucleotide of a mixture is 

9 conq)lementary to the base occxq>ying the site of variation in the target polynucleotide, at 

10 least one copy of the primer is extended by incorporation of the labelled nucleotide; and 

11 (b) detecting the incorporation of labelled nucleotide into the primer(s) 

12 extended by the labelled nucleotide(5) as an indication of the base(s) occupying the site of 

13 variation in the target polynucleotide, 

1 18. The method of claim 17, wherein each of the plurality of mixtures 

2 comprises a nuxture of labelled and unlabelled non-extendible nucleotides. 

1 19. The method of claim 17, wherem eadi of the plurality of mixtures 

2 comprises a mixture of labelled and unlabelled extendible nucleotides. 

1 20. The method of claim 1 8, wherein the non-extendible nucleotides are 

2 selected from the group consisting of a dideoxynucleotide, and an arabinoside triphosphate. 

1 21. The method of claim 17, wherein for each of the plurality of mixtures 

2 the unlabelled form is 50% to 90% of the total amount of labelled and unlabelled forms on a 

3 molar basis. 

1 22. The method of claim 17, whereia the plurality of mixtures is 2, 3 or 4 

2 mixtures. 

1 23 . The method of claim 17, wherein the contacting step comprises 

2 dividing the sample into a ptoality of separate samples such that the numb^ of samples is 

3 equivalent to the number of mixtures of nucleotides and contacting each of the samples with 

4 a different mixture of nucleotides. 

1 24. The method of claim 17, whmin the plurality of mixtures is a first 

2 mixture of labelled and unlabelled forms of a first nucleotide and a second mixture of 

3 labelled and unlabelled forms of a second nucleotide. 
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1 25. The method of 24, wherein the molar ratio of UBlabelled to labelled 

2 forms in the first mixture is at least 1:2 and the ratio of the imlabelled to labelled forms in the 

3 second mixture is at least 1 :2. 

1 26. The method of claim 17, wherein the sample of the target 

2 polynucleotide is a heterozygous sample comprising first and second target polynucleotides 

3 differing at the site of variation, and at least one copy of primer hybridizing to the first target 

4 polynucleotide is extended by incorporation of the first labelled nucleotide, and at least one 

5 copy of primer hybridizing to the second target polynucleotide is extended by incorporation 

6 of the second labelled nucleotide. 

1 27. The method of claim 17, wherein the target polynucleotide is fix>m a 

2 diploid subject and con:q)rises a first and/or second target polynucleotide that differ at the site 

3 of variation, whereby if tiie first target polynucleotide is present, at least one copy of prim^ 

4 hybridizes to the first target polynucleotide and is tended by incorporation of the first 

5 labelled nucleotide, and if the second target polynucleotide is present, at least one copy of 

6 primer hybridizes to the second target polynucleotide and is extended by incorporation of the 

7 second labelled nucleotide. 

1 28. The method of claim 27, wherein the detecting step comprises 

2 detecting the presence of primers extraded by the first and/or second labelled nucleotide, the 

3 presence of primers extended only by the incorporation of first labelled nucleotide or second 

4 labelled nucleotide indicating that the subject is a homozygote, whereas the presence of 

5 primers extended by the incorporation of first labelled nucleotide and primers extended by 

6 second labelled nucleotide indicates that the subject is a heterozygote. 

1 29. The method of claim 24, wherein the detecting step comprises 

2 detecting a first signal fix>m prim^ extended by the first labelled nucleotide and a second 

3 signal Scorn primer extended by the second labelled nucleotide, and the molar ratio of the first 

4 and second labeled nuicleotides is suGh'that!^h6'^first atid-second signals are of similar strength. 



45 





wo 01/94546 



PCT/USOl/17928 



1 



30. The method of claim 1 7, wherein the conditions include presence of a 



2 themiostable polymerase, and perfonning multiple cycles of denaturation, annealing and 

3 primer extension. 



2 comprising contacting the target polynucleotide with a primer hybridizing adjacent to but not 

3 including ibe site of variation and a labelled nucleotide imder conditions in which the primer 

4 is extended by the labelled nucleotide if the labelled nucleotide is complementary to the site 

5 of variation, the improvement wherein the target polynucleotide is contacted with a mixture 

6 of labelled and unlabelled forms of the nucleotide and the unlabelled form is at least 30% of 

7 the combined amoimt of the labelled and unlabelled forms on a molar basis. 

1 32. A method for analyzing sites of variation in target polynucleotides, 

2 comprising: 

3 (a) conducting a plurality of template-dependent extension reactions with 

4 a plurality of different primers and one or more mixtures of labelled and unlabelled forms of 

5 a nucleotide, wherein different primers hybridize adjacent different sites of variation on target 

6 polynucleotides and different mixtures contain different nucleotides, the unlabelled form in 

7 each mixture being 20% to 90% of the combined amount of the labelled and unlabelled forms 

8 on a molar basis, each extension reaction comprising 

9 (i) contacting a sample of the target polynucleotides with multiple 

1 0 copies of one of the different primers, wherein the primer bears an optional label and tiie 3 ' 

1 1 end of the prim^ hybridizes adjacent to but not including the site of variation in the target 

12 polynucleotide, and 

1 3 (ii) exposing the copies of the primer to one or more of the 

14 mixtures of labelled and unlabelled nucleotides, under conditions such that if the nucleotide 

15 of the mixture is complementary to the base occupjdng the site of variation in fhe target 

16 polynucleotide, at least one copy of the primer is extended by incorporation of the labelled 

17 nucleotide, 

1 8 whereby the extension reactions generate a plurality of different extension 

19 products, extension products generated fiom different sites of variation being distinguishable 



1 



31. In a method of analyzing a site of variation in a target polynucleotide 



46 



wo 01/94546 



PCT/USOl/17928 



20 on the basis of fhe optional label borne by the extended prima: and/or label borne by 

21 incoiporated nucleotide; and 

22 (b) detecting the incoiporationoflabelled nucleotides into the extended 

23 piimers as an indication of ttie bases occupying Ihe site of vaxiation in the target 

24 polynucleotides. 

1 33. The method of claim 32, wherein the optional label is selected from the 

2 group consisting of a fluorophore, a chromophore, a radioisotope, an enzyme substrate, an 

3 electron dense agent, a magnetic particle, an electrochemicaUy active moiety and a mass 

4 labeL 

1 34. The method of claim 32, wherein the different sites of variation are 

2 different sites on the same target polynucleotide or same site on different polynucleotides and 

3 the extension reactions are conducted in a single reaction vesseL 

1 35. The method of claim 32, further comprising separating the different 

2 extension products prior to detecting incotporation of labelled nucleotide, separation being 

3 accomplished by electrophoresis or high pressure liquid chromatogn^hy. 

1 36. The method of claim 32, wherein 

2 (i) the miiltiple copies of primer are labeled with a primer label, whereby 

3 extension products bear two labels, a primer label and label from incorporated labelled 

4 nucleotide, the two labels comprising a donor fluorophore and an acceptor fluorophore; and 

5 (ii) detecting comprises detecting changes in fluorescence of the donor 

6 and/or acceptor fluorophores. 

1 37. The method of claim 32, wherein detection of labelled nucleotide 

2 incorporation is performed by fluorescence polarizatioi;!. 

1 38. The method of claim 32, wherein the different sites of variation are 

2 located on different target polynucleotides and each extension reaction is conducted in a 

3 separate reaction vessel, and further comprising collecting extension product from the 

4 reaction vessels prior to detection. 
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1 39. A composition comprising a mixture of labelled and milabeUed fonns 

2 of a nucleotide, wherein the unlabelled fomi is 30% to 95% of the combined amount of 

3 labelled and unlabelled forms on a molar basis. 

1 40. The composition of claim 39, herein the unlabelled nucleotide is a 

2 non-extendible nucleotide. 

1 41. The composition of claim 39, wherein the unlabelled nucleotide is an 

2 extendible nucleotide. 

1 42. The composition of claim 39, wherein the label borne by the labelled 

2 nucleotide is a fluorophore. 

1 43. The composition of claim 42, wherein the fluor€>phore is selected Scam 

2 the gtoiq> consisting of FAM, ROX, TAMRA, Rl 10, R6G, Joe, HEX, TET, Alexa, Cy3 and 

3 Cy5- 

1 44. A kit for analyzing a variant site in a target nucleic acid, comprising: 

2 (a) a mixture of labelled and unlabelled nucleotide, the unlabelled 

3 nucleotide being 30% to 95% of the combined amount of unlabelled and labelled forms of the 

4 nucleotide on a molar basis; 

5 (b) a polymerase capable of catalj^zing template-dependent extension of a 

6 primer. 

7 (c) a primer that hybridizes to a segment of the target nucleic acid, 

8 wherein the 3 * end of the primer hybridizes adjacent to the site of variation in the target 

9 nucleic acid and bears an optional label. 

1 45. The kit of claim 44, wherdn the label bome by the labelled nucleotide 

2 is a fluorophore. 

1 46. The kit of claim 45, wherein the fluorophore is selected from the group 

2 consisting of FAM, ROX, TAMRA, Rl 10, R6G, Joe, HEX, TET, Alexa, Cy3 and Cy 5. 
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1 47. Tlieldtofclaim 44, iTv^erein the polymerase is a them 

2 polymerase. 

1 48 . The kit of claim 44, wherein the 3 ' end of the primer hybridizes 

2 iimnediately adjacent to the site of variation in the target nucleic acid. 

1 49. The kit of claim 48, wherein tiie primer is at least S primers, each 

2 primer hybridizing adj acent to a different variant site in the target nucleic acid. 

1 50. The kit of claim 49, wherein each variant site is a polymorphic site 

2 correlated with a disease. 
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aaggcaagtg acctggcaaa ggcctcccag 



catatatggt cacaggtaaa 



actcaagcct ggtctgttat tccctcccca aaactgcctg gtagg gagaa 
gggaaagcca gcctc (R) gctgtgactg • actcctggct atctggagct 
ctccctccag attcctttct acctctctgg ctgtcacccc agtgctaaga 
gcctgggctc aggccctcct ctctctaccc tggctgctcc cgctcattcc 
tgtggcttct gtccccagta ctctagca 



FIG.2 
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